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TARGETING GENE TRANSFER VECTORS TO 
CERTAIN CELL TYPES BY PSEUDOTYPING WITH 
VIRAL GLYCOPROTEIN 

FIELD OF THE INVENTION 

5 The present invention relates generally to compositions and methods for 

selective gene transfer, and in particular, to methods for targeting genes to certain ceil 
types, comprising introducing to a cell population the gene to be transferred 
operatively-linked to an appropriate transfer vehicle, wherein the transfer vehicle is 
associated with a transmembrane form of viral glycoprotein. 

1 0 BACKGROUND OF THE INVENTION 

Ebola virus has been identified as the cause of several highly lethal outbreaks 
of hemorrhagic fever. Infection begins typically with flu-like symptoms which often 
progress rapidly to fatal complications of hemorrhage, fever, and hypotensive shock. 
Bowen, E.T.W. et al., Lancef 1:571 (1977); Centers for Disease Control, MMW.R. 

15 44:381 (1995); Le Guenno, B. et al., Lancet 345:1271 (1995); Peters, CJ. et al., 
Fields Virology, B.N. Fields, D.M. Knipe and P.M. Howley, Eds. (Lippincott-Raven, 
Philadelphia) p. 1 161 (1996). The negative-stranded genome of Ebola virus contains 
seven structural and regulatory proteins (Sanchez, A. et al., Virus Res, 29:215 
(1993)), but despite its relative simplicity, the molecular basis for Ebola virus 

20 pathogenicity is unknown. Among the viral gene products, the glycoprotein is found 
in two forms: a secreted form, 50-70 kD (Sanchez, A. et at., PNAS (USA) 93:3602 
(1996)), synthesized at high levels early in the course of infection, and an alternative 
transmembrane form, which arises from RNA editing to encode a 120-150 kD 
glycoprotein that is incorporated into the virion. Sanchez, A. et al., PNAS (USA) 

25 93:3602 (1996); Volchkov, V.E, et al, Virology 214:421 (1995). The first 295 amino 
acids (aa) of both proteins are identical in the Zaire strain, while sGP contains an 
additional 69 and GP another 381 COOH- terminal aa residues. Sanchez, A. et al., 
PNAS (USA) 93:3602 (1996). The specific cellular targets of these related gene 
products and their roles in the pathogenesis of Ebola infection have not been 
30 characterized. 

SUMMARY OF THE INVENTION 

The present invention provides compositions and methods for targeting gene 
transfer vectors to certain cell types by pseudotyping with a transmembrane form of 
viral glycoprotein. In one embodiment, the methods of the invention comprise the 
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step of administering to a cell population a gene to be transferred operatively-linked 
to an appropriate transfer vehicle, wherein the transfer vehicle is associated with a 
transmembrane form of Ebola glycoprotein. In this embodiment, the gene will be 
targeted to ceil types naturally infected with Ebola such as endothelial cells, 
5 monocytes and hepatocytes. 

Genetic constructs for selective gene transfer into certain cell types are also 
provided. The genetic constructs of the present invention comprise a gene to be 
transferred operatively-linked to an appropriate transfer vehicle or carrier, wherein the 
transfe vehicle or carrier is associated with a transmembrane form of viral 
10 glycoprotein. In one embodiment, the transmembrane form of Ebola glycoprotein is 
expressed on the surface of a virus-based gene-targeting vector, e.g., lentiviral or 
retroviral vector. In another embodiment, an expressed or synthesized 
transmembrane glycoprotein is chemically derivatized to a non-biologic gene targeting 
vehicle. 

15 Additional objects, advantages, and features of the present invention will 

become apparent from the following description and appended claims, taken in 
conjunction with the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The various advantages of the present invention will become apparent to one 
20 skilled in the art by reading the following specification and subjoined claims and by 
referencing the following drawings. 

Figures 1 A-1C show the binding of sGP to neutrophils; 
Figures 2A-2D show the infection of different cell types by a GP-pseudotyped 
vector of the present invention; 
25 Figures 3A-3F show the dependence of sGP binding on CD1 6b and correlation 

of binding with neutrophil activation; 

Figures 4A-4B show the effect of sGP on neutrophil function; 
Figures 5A-5C show the infection rate of cells with a GP-pseudotyped retroviral 
vector of the present invention; 
30 Figure 6 is a schematic of the piasmid pVR 1012-GP(IC) (Ivory Coast strain 

of GP f see SEQ ID NO: 1); 

Figure 7 is a schematic of the piasmid pVR 1012-GP(S) (Sudan strain of GP, 
see SEQ ID NO: 2); 

Figure 8 is a schematic of the piasmid pVR 1012-GP(Z) (Zaire strain of GP, 
35 see SEQ ID NO: 3); 
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Figure 9 is a schematic of the plasmid pVR 1012-sGP(Z) (Zaire strain of sGP, 
see SEQ ID NO: 4); and 

Figure 10 is a summary of the characterization of GP and sGP derivatives for 
their ability to pseudotype to induce cytotoxicity in producer cells. 
5 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention provides genetic constructs and methods for targeting 
gene transfer vectors to certain cell types by pseudotyping with a transmembrane 
form of viral glycoprotein. The methods for selective gene transfer of the present 
invention comprise the step of administering to a cell population a genetic construct 
10 of the present invention so that the gene is transferred and expressed in certain cell 
types present in the cell population. Administration to the cell population may be ex 
vivo or in vivo. 

The genetic constructs of the present invention comprise a gene to be 
transferred operatively-linked to an appropriate transfer vehicle or carrier, wherein the 

15 transfer vehicle or carrier is associated with a transmembrane form of viral 
glycoprotein. In one embodiment, the transmembrane form of Ebola glycoprotein is 
associated with the vehicle or carrier. The gene to be transferred will thus be 
targeted to cell types naturally infected with Ebola virus including endothelial cells, 
hepatocytes, monocytes and related cell types such as dendritic cells. The 

20 transmembrane form of Ebola glycoprotein may be chosen from, without limitation, the 
Ivory Coast strain (SEQ ID NO: 1), Sudan strain (SEQ ID NO: 2), the Zaire strain 
(SEQ ID NO: 3) and/or the Reston strain. It will be appreciated that in other 
embodiments of the present invention, other hemorrhagic fever virus glycoproteins, 
in particular transmembrane glycoproteins, may be employed and will target those cell 

25 types naturally infected by the virus. Examples of hemorrhagic viruses include 
dengue virus, Yellow Fever virus {flaviviridae): Lassa, Junin and Machupo 
{arenaviridae)\ Rift Valley, Congo-Crimean and Hantaan (bunyaviridae); and Marburg 
(filoviridae). It will also be appreciated that derivatives of the transmembrane 
glycoprotein which retain the capability of targeting specific cell types, may also be 

30 employed, for example, the transmembrane glycoproteins may be mutated, e.g., toxic 
regions may be removed to improve producer cell viability (see Figure 10), 

The transmembrane glycoprotein may be expressed on the surface of a virus- 
based gene-targeting vector, e.g., tentivtral, retroviral, replication-deficient retroviral, 
adenoviral or adeno-associated viral vector. The transmembrane glycoprotein may 
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also be expressed or synthesized and chemically derivatized to a non-biologic gene 
targeting vehicle, e.g., liposome or DNA-protein complex. 

The term "operatively-linked" as used herein refers to functional linkage 
between a nucleic acid expression control sequence (such as a promoter) and a 
5 second nucleic acid sequence (i.e., gene), wherein the expression control sequence 
directs transcription of the nucleic acid corresponding to the second sequence- 
Expression control sequences are known to those skilled in the art (see, e.g., 
Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic 
Press, San Diego, CA (1990)). "Associated with" as used herein refers to the 
1 0 transmembrane form of viral glycoprotein being in contact or linkage with the transfer 
vehicle or carrier in such a way as to direct the transfer vehicle or carrier to certain 
cell types. The terms "transfer vehicle" and "carrier" refer to any type of structure 
which is capable of delivering the gene of interest to a target cell. 

Many transfer vehicles or carriers are known in the art. For example, various 
15 viruses that are capable of infecting cells can be recombinant^ manipulated to carry 
the gene of interest without affecting their infectivity. As used herein, the terms 
"infect" and "infectivity" refer only to the ability of a virus to transfer genetic materia! 
to a target cell. Those terms do not mean that the virus is capable of replication in 
the target cell. In fact, it is preferable that such viruses are replication defective so 
20 that target cells do not suffer the effects of viral replication. 

in one embodiment, the virus employed is a replication defective retroviruses. 
When these replication defective retroviruses are employed, their genomes can be 
packaged by a helper virus in accordance with well-known techniques. Suitable 
retroviruses include PLJ, pZip, pWe and pEM, each of which is well known in the art. 
25 Suitable helper viruses for packaging genomes include ^Crip, y/Cre, tp2, yAm and 
adeno-associated viruses. 

In another embodiment, lentiviral vectors are employed. Surprisingly, the 
inventors of the present invention were successful in pseudotyping lentiviral vectors 
(HIV) with the transmembrane glycoprotein from Ebola. Feline immunodeficiency 
30 virus, bovine immunodeficiency virus, simian immunodeficiency virus and EAIV, may 
also be employed as the carrier in the compositions and methods of the present 
invention. 

Gene delivery systems other than viruses may also be employed. For 
example, the gene to be transferred may be packaged in a liposome which is 
35 chemically derivatized to the transmembrane glycoprotein. To form these liposomes, 
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one mixes the DNA of an expression vector which expresses the gene to be 
transferred with lipid, such as ^-[^(Z^^dioleyloxyJpropyi^^N./V-trimethylammonium 
chloride (DOTMA) in a suitable buffer, such as Hepes buffered saline. This causes 
the spontaneous formation of lipid-DNA complexes (liposomes). Feigner, P.L et al., 
PNAS (USA) 84:7413-7417 (1987). 

Another gene delivery system that may be utilized in this invention is DNA- 
protein complexes. The formation of DNA-protein complexes is described in United 
States Patent No. 5,166,320, the disclosure of which is herein incorporated by 
reference. 

It will be appreciated that any gene may be employed in the compositions and 
methods of the present invention. For example, and without limitation, in the 
treatment of cancer, death inducing genes, including genes coding for cytostatic or 
cytotoxic proteins, e.g., HSV tk, and genes encoding cyclin dependent kinase 
inhibitors, p21, p27, cytosine deaminase, and fas ligand, may ail be employed. In 
another example, for the treatment of cardiovascular or ischemic vascular disease, 
genes encoding angiogenic factors such as VEGF basic or acidic FGF's (FGF 1-5) 
may be employed. In yet another example, in the treatment of vasospasm, the gene 
encoding NO synthase or heme oxygenase, may be employed. In a further example, 
monocytes and dendritic cells may be targeted with genes encoding immunogens for 
cell-targeted immunization. 

In one embodiment, the methods of targeting gene transfer vectors to certain 
cell types involve administering to a cell population ex vivo, a construct of the present 
invention and introducing the transfected ceils into a subject. In an alternative 
embodiment, the methods of the present invention comprise administering to an in 
vivo cell population a construct of the present invention. Administration can be by any 
of the routes normally used for in vivo gene therapy such as direct delivery to cells 
via a gene gun, and other known techniques. The constructs are thus administered 
in any suitable manner, preferably with pharmaceutically acceptable carriers. The 
constructs can be administered, for example, by intravenous infusion, orally, topically, 
intraperitoneally, intravesically or intrathecally. The preferred method of 
administration will often be intravenous. 

To practice an ex vivo method of the present invention, a source of cells is 
obtained. The cells are optionally selected from in vitro cells, such as those derived 
from cell culture and ex vivo cells, such as those derived from a subject. The term 
"subject" is intended to include living organisms, e.g., mammals. Examples of 



WO 99/37331 PCT/US99/01382 

- 6 - 

subjects include humans, primates, dogs, cats, mice, rats, and transgenic species 
thereof, ft will be appreciated that specific cell populations may be obtained by 
isolation from certain tissues by methods known to those skilled in the art. The ceils 
are maintained under conditions necessary to support growth, for example an 
appropriate temperature (e.g., 37°C) and atmosphere (e.g., air plus 5% C0 2 ). 

The cells are then transfected with the constructs of the present invention by 
introducing the constructs to the cell population, under conditions favorable for 
transfection. According to one embodiment of the present invention, cells are treated 
with compounds that facilitate uptake of the constructs by the cells. According to 
another embodiment of the present invention, cells are treated with compounds that 
stimulate cell division and facilitate uptake of the constructs. It will be appreciated 
that compounds that facilitate uptake of constructs by cells and compounds that 
stimulate cell division are known to those skilled in the art. 

The constructs of the present invention express the transferred gene in a dose 
dependent manner. The specific dose to be administered to a patient will be 
determined by the efficacy of the particular construct and/or delivery system 
employed, the gene transferred, and the condition of the patient, as well as the body 
weight or surface area of the patient to be treated. The size of the dose also wilt be 
determined by the existence, nature, and extent of any adverse side-effects that 
accompany the administration of a particular construct or effect a particular patient, 
in determining the effective amount of the construct or transfected cell to be 
administered, the physician needs to evaluate circulating plasma levels, toxicities, and 
progression of disease. It will be appreciated that administration can be accomplished 
via single or divided doses. 

There is a wide variety of suitable formulations for pharmaceutical 
compositions containing the constructs of the present invention. Formulations suitable 
for oral administration can consist of (a) liquid solutions, such as an effective amount 
of the construct dissolved in diluents, such as water, saline or PEG 400; (b) capsules, 
sachets or tablets, each containing a predetermined amount of the construct, as 
liquids, solids, granules or gelatin; (c) suspensions in an appropriate liquid; and (d) 
suitable emulsions. The construct, alone or in combination with other suitable 
components, may also be made into aerosol formulations to be administered via 
inhalation, e.g., to the bronchial passageways. Aerosol formulations can be placed 
into pressurized acceptable propellants, such as dichlorodifluoromethane, propane, 
nitrogen, and the like. 
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Suitable formulations for rectal administration include, for example, 
suppositories, which consist of the construct with a suppository base. Suitable 
suppository bases include natural or synthetic triglycerides or paraffin hydrocarbons. 
In addition, it is also possible to use gelatin rectal capsules which consist of a 
5 combination of the construct with a base, including, for example, liquid triglycerides, 
polyethylene glycols, and paraffin hydrocarbons. 

Formulations suitable for parenteral administration, such as, for example, by 
intraarticular (in the joints), intravenous, intramuscular, intradermal, intraperitoneal, 
and subcutaneous routes, include aqueous and non-aqueous, isotonic sterile injection 

10 solutions, which contain antioxidants, buffers, bacteriostats, and solutes that render 
the formulation isotonic with the blood of the intended recipient, and aqueous and 
non-aqueous sterile suspensions that can include suspending agents, solubilizers, 
thickening agents, stabilizers, and preservatives. The formulations can be presented 
in unit-dose or multi-dose sealed containers, such as ampules or vials. 

15 Extemporaneous injection solutions and suspensions can be prepared from sterile 
powders, granules, and tablets of the kind previously described. Cells transfected by 
the constructs as described above in the context of ex vivo therapy can also be 
administered as described above. 

This invention also provides compositions and kits comprising the constructs 

20 of the present invention. For example, the composition can comprise the constructs 
of the present invention in a pharmaceutical^ acceptable carrier as described above. 
Kits comprising such compositions and instructions for use are also within the scope 
of this invention. 

In order to more fully demonstrate the advantages arising from the present 
25 invention, the following examples are set forth. It is to be understood that the 
following is by way of example only and is not intended as a limitation on the scope 
of the invention. 

SPECIFIC EXAMPLE 1 

I. Methods 

30 Recombinant retroviruses were produced by transient transfection of 293T 

cells: 2 x 10 6 cells were plated 24 hours before transfection in 60 mm dishes. 
Transfection was performed by calcium-phosphate precipitation using 3 //g of a 
retroviral vector (Kinsella, T.M. et al., Hum. Gene Ther. 7:1405 (1996)) encoding 
lucerifase linked to an internal ribosome entry site and a green fluorescent protein 

35 derivative (GFP; pEGFP, Clontech), pLZRs-Luc-Gfp, 5 pg of an expression vector 
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encoding gag and pol, pNGVL-MLVgag-pol, and 1 jjg of the envelope encoding 
plasmid: pNGVL-4070A (ampho) env, pCMV-Eco env or p1012-Eboia GP, 
respectively. Supernatants corresponding to 24-48 hours post-transfection were 
harvested, cleared by low-speed centrifugation and either used immediately for 
5 infection or frozen at -80°C. Infections were performed in 6-well plates (1 .5-2.5 x 10 5 
adherent cells) or 12-well plates (5 x 10 5 non-adherent) using different dilutions of the 
supernatants by incubating the cells overnight with 1 ml and 300 //I, respectively of 
the diluted supernatants. Polybrene was used at a concentration of 5 //g/ml for all the 
cell lines except for D17 in which the concentration was 100 f/g/ml. After overnight 

10 infection, fresh medium was added and the cells were incubated for an additional 24 
hours. After infection, the cells were lysed in 25 mM Tris-phosphate pH 8, 2 mM 
DTT, 2 mM 1,2-diaminocyciohene-N,N,N\N -tetraacetic acid, 10% glycerol, 1% 
TritonX-100, and assayed for luciferase activity using Luciferase Assay Reagent 
(Promega, Madison, Wl) in a 1251 BioOrbit Luminometer. The same number of cells 

15 (range 5-10 x 10 4 ) was analyzed for every specific cell line. 

Binding of sGP to neutrophils and inverse correlation of binding with 
activation: Figures 1A-1A2. PBMC from normal volunteers were incubated with 
control or sGP supernatants derived from transfected 293 cells, and immunostaining 
was performed using a rabbit antibody to sGP as previously described. Sanchez, A. 

20 et a!., PNAS (USA) 93:3602 (1996); Xu, L. et al., Nat Med. (1997) in press. 
Secondary staining was performed with a fluorescein isothiocyanate (FITC)- 
conjugated goat anti-rabbit IgG antibody (Sigma, F9887). All incubations were 
performed at 4°C for 30 minutes with .4 jjg of the relevant antibodies per 10 6 cells in 
a 50 jj\ volume. 

25 Figures 1B-1B1. Double immunostaining with antibodies to sGP and the 

neutrophil-specific marker, CD15. Cells were incubated with a FITC conjugated 
mouse anti-human CD15 antibody (Caltag, cat# MHCD1501), followed by secondary 
staining with a PE-conjugated anti-rabbit IgG antibody (Sigma) to detect sGP binding. 
Ceils were washed with PBS, fixed in 1% formaldehyde, and analyzed by FACS. 

30 Figure 1C. Specific absorption of sGP by neutrophils. Control or sGP 

supernatants derived from relevant transfected 293 cells (Xu, L. et al, Nat Med. 
(1 997) in press) were incubated at 1 :500 dilution with 1 0 6 mononuclear or granulocytic 
cells. Cells were removed and the resulting supernatants analyzed by an 8% SDS 
PAGE gel. Western blot analysis was performed as previously described (Xu, L. et 

35 al., Nat Med. (1997) in press) using an anti-GP rabbit antisera and a secondary 



WO 99/37331 PCT/US99/01382 

- 9 - 

antibody, horseradish peroxidase conjugated donkey anti-rabbit IgG at a dilution of 
1 :5,000 (Amersham, NA934). Primary antibody was incubated for 30 minutes at room 
temperature, as was the secondary antibody. The immunocomplexes were detected 
by chemiluminescence using Supersignai® chemiiuminescent substrate reagents 
5 (Pierce) according to the manufacturer's instructions. Arrow indicates sGP reactive 
band. 

Infection of different cell types by GP-pseudotyped retroviral vector and 
preferential binding to endothelial cells: Figure 2A. Infection of different indicator 
cell lines with the Ebola-GP pseudotyped retrovirus expressing iuciferase. 

10 Amphotropic and ecotropic retroviral vectors were used as controls. Viruses were 
diluted to different multiplicities of infection (MOI) to provide for equal iuciferase 
activity on Heia cervical epithelial cells, permissive for amphotropic retrovirus, D17 
dog osteosarcoma cells (Embretson, J.E. et al. f J. Virol. 61:3454 (1987)), which are 
permissive for amphotropic, xenotropic, and ecotropic retroviruses, and BW5147 T 

15 leukemia cells permissive for amphotropic and ecotropic virus. In these groups, GP 
virus titer was 1-4x10 5 /ml and amphotropic virus was ~2x10 4 /ml (MOI's«1.0 and 0.1, 
respectively), and the ecotropic virus titer was ~10 6 /mi (MOI=*10). Titers were 
determined by endpoint dilution of reporter activity of the amphotropic virus in D17 
cells, and was normalized to reverse transcriptase activity for the GP virus. 

20 Figure 2B. Analysis of different normal or transformed cell lines by infection 

with amphotropic or GP retroviral vectors at the same titer (10 4 /ml, MOI *0.2). Forty- 
eight hours after infection, an equivalent of 5 x 10 4 cells was assayed for Iuciferase 
activity after exposure to equal titers of viral stocks. Luminescence is expressed as 
the fold-increase over non-infected control cells. 

25 Figures 2C-2C3. The binding of sGP (left) or GP-pseudotyped retrovirus 

(right) to neutrophils (upper panel) or microvascular endothelium (lower panel) was 
determined by FACS. sGP binding was performed as in Fig. 1A, and retrovirus 
incubation was performed at 37°C for 2 hours in the presence of polybrene (8 j/g/ml). 
Figure 2D. Infection of D17 cells by GP-pseudotyped virus in the absence 

30 (lane 1 , none) or presence of control (lane 2) or sGP supernatant (lane 3) from 
transfected 293 cells. Gene transfer was measured by the Iuciferase assay as 
described below. Luminescence refers to relative light units in the Iuciferase assay. 

Dependence of sGP binding on CD16b and correlation of binding with 
neutrophil activation: Figures 3A-3D. Neutrophils were incubated for 30 minutes 

35 at 4°C with a mouse antibody to CD16b (upper panel; clone 3G8 from Immunotech, 
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cat# 1M0813) or CD62L (middle panel, R&D Systems), compared to the indicated 
control antibody [purified mouse IgG (Vector Laboratories), I-2000], followed by 
supernatants from control or sGP-transfected 293 cells, primary rabbit antibody to 
sGP, and a FITC-conjugated secondary antibody to rabbit IgG (Fig. 1 , legend). Cells 
5 were washed with PBS, fixed in 1% formaldehyde, and analyzed by FACS. For 
blocking, 10 6 ceils were incubated with 0.5 - 1 /jg of the relevant antibodies for 30 
minutes in a 50 //I volume. 

Figures 3E-3F. Immunostaining with sGP was performed on isolated 
neutrophils which were maintained in media (none) or incubated with PMA (10 ng/ml) 

10 at 37°C for 30 minutes (PMA). 

Effect of sGP on neutrophil function: Figures 4A-4B. Exposure of 
neutrophils to sGP inhibits down modulation of L-selectin. Isolated neutrophils were 
incubated with the indicated control or sGP containing supernatants (Xu, L. et al., Nat. 
Med. (1997) in press) and defined media (AIM V, GIBCO) for 4 hours at 37°C. 

15 Expression of L-selectin was determined using an anti-CD62L antibody (R&D 
Systems), followed by the secondary staining using a FITC-conjugated anti-mouse 
IgG (Sigma, F2883) as described in Fig. 1, legend. Cells were washed with PBS, 
fixed with 1 % formaldehyde and analyzed by FACS for relative levels of fluorescence 
intensity as a function of cell number. An isotype control was used to quantitate 

20 background levels of immunostaining (neg.). Results are representative of three 
independent experiments. 
11. Results 

To determine the specificity of Ebola virus glycoproteins, expression vectors 
encoding either sGP, GP, or a plasmid control (Xu, L. et al., Nat. Med. (1997) in 

25 press) were transfected into 293 cells, and cell culture supernatants were used as a 
source of relevant recombinant glycoproteins. Binding of sGP was determined by 
immunofluorescence analysis after incubation of relevant supernatants with normal 
or transformed human cell lines. No binding was detected to several hematopoietic 
lineages, including lymphocytes or monocytes (Fig. 1A), or transformed JurkatorCEM 

30 T leukemias, the HL60 myelomonocytic or U937 promonocytic leukemia cells. In 
contrast, sGP was able to bind to granulocytic cells, as evidenced by FACS analysis 
of this subset of peripheral blood mononuclear cells (PBMC) discriminated by cell size 
and granularity (Fig. 1 A). This cell specificity was confirmed by using double-staining 
with a granulocyte-specific cell surface marker, CD15 (Fig. 1B). Absorption of sGP 
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by purified neutrophils in the absence of antibodies also resulted in depletion of sGP, 
indicating that binding to the neutrophil occurred in the absence of antibody (Fig. 1 C). 

A potential structural similarity between Ebola GP and avian sarcoma virus 
envelope protein has been previously proposed (Gallaher, W.R., Cell 85:477 (1996)), 
5 raising the possibility that this protein could be incorporated into retroviral particles. 
To determine the binding specificity of the transmembrane glycoprotein, pseudotyping 
of a Moloney leukemia virus was therefore attempted. Infectivity of different cell types 
by this pseudotyped vector was determined with a luciferase reporter gene, de Wet, 
J.R. et al. t MoL Cell. Biol. 7:725 (1987). This analysis revealed infection of cells 

10 different from those which interacted with sGP (Fig. 2A,B). For example, though it 
could infect other cell types, transduction by the GP retroviral vector readily occurred 
in endothelial cells, either from the microvasculature (MVEC) or umbilical veins 
(HUVEC) (Fig. 2B), which did not bind sGP (Fig. 2C, left). When the specificity of 
GP-retrovirus was compared to murine retroviruses pseudotyped with amphotropic or 

15 ecotropic envelope gp70 proteins, the range of susceptible target cells differed (Fig. 
2B), suggesting that the virus receptor(s) for Ebola GP differ from those previously 
described forgp70. Minimal binding of GP-virus was observed on neutrophils, despite 
the ability of these cells to bind sGP (Fig. 2C, upper panel) and the fact that 
immunoreactive protein was detected on the virus. Conversely, GP-virus binding to 

20 endothelial cells was readily detected, though these cells did not bind sGP (Fig. 2C, 
lower panel). When sGP was analyzed for its effect on GP retroviral gene transfer, 
infection was not inhibited by sGP (Fig. 2D) r further confirming the divergent 
specificities of the two forms of the viral glycoprotein. Recent studies have revealed 
that the biochemical forms of these proteins differ, with sGP present in solution 

25 primarily as a homodimer and GP as a trimer, suggesting that differences in multimer 
composition may contribute to these alternative specificities. 

Potential cell surface receptors for sGP were analyzed with antibodies to 
several neutrophil cell surface antigens to interfere with sGP binding, including CD1 5, 
L-selectin (CD62L), CD 16b, and several common leukocyte antigens. Only the 

30 neutrophil-specific form of the low affinity F c preceptor III, CD16b, inhibited sGP 
binding specifically. Antibodies to CD62L, for example, did not inhibit sGP binding 
(Fig. 3). Binding to neutrophils correlated with their activation state and CD 16b 
expression since no binding was observed in cells stimulated with phorbol 12- 
myristate 13-acetate (PMA) for 30 minutes, at which time CD16b expression was 

35 markedly decreased on these cells (Fig. 3, lower panel). Overexpression of this form 
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of CD16 on a heterologous cell type, 3T3 fibroblasts, did not confer sGP binding to 
these cells by FACS analysis, suggesting that CD16b is necessary but not sufficient 
for stable binding. 

Binding of sGP did not inhibit neutrophil activation in response to potent 
5 pleiotropic activators (PMA, IL-8, or f-Met-Leu-Phe), as measured by down modulation 
of L-selectin expression using FACS analysis, in a defined serum-free medium, 
partial activation of neutrophils was observed, with a decrease in L-selectin 
expression at 4 hours (Fig. 4). Under these conditions, incubation of neutrophils with 
sGP supernatant prevented this decrease in L-selectin expression (Fig. 4). Because 

10 L-selectin was not required for sGP binding (Fig. 3), this effect was apparently 
indirect, through a mechanism not yet defined, possibly involving CD16b or 
carbohydrate interactions of the highly glycosylated sGP protein. 

The expression of alternative Ebola virus glycoproteins in clinical infection has 
long been recognized, but their functional roles and cell specificity have not been 

15 defined. Early after infection, high levels of the secreted protein are found in the 
serum and precede fulminant replication and dissemination of virus systemically, at 
which time synthesis of transmembrane GP is markedly increased, Sanchez, A. et 
al., PNAS (USA) 93:3602 (1996). The inventors have now found that the binding 
specificities of these two molecules differ. It had been proposed that sGP may serve 

20 as a decoy to prevent recognition of GP, possibly to temporarily inhibit virus binding 
to target cells. The studies set forth herein suggest that this hypothesis is unlikely to 
be correct. The binding specificities of these proteins differ, and despite the fact that 
they are derived from the same viral gene, it has been surprisingly found that 
alternative forms of the glycoprotein have been selected for different functions. 

25 Although these proteins share identical amino terminal sequences, their 

carboxyl terminal regions differ. Sanchez, A. et al., Virus Res. 29:215 (1993). These 
sequences are likely responsible for the differences in binding specificity, either 
through direct interactions in these domains or by their effect on multimerization. The 
secreted glycoprotein binds to neutrophils to prevent early events in activation, 

30 possibly serving to diminish any inflammatory responses which might provide innate 
immunity to the virus, facilitating productive viral replication. The subsequent increase 
in GP synthesis gives rise to virus which in turn could infect other cells. Filoviruses 
have been shown previously to infect and replicate in different cell types and appear 
to grow readily in endothelial cells in vivo. Peters, C J. et al., Fields Virology, B.N. 

35 Fields, D.M. Knipe and P.M. Howley, Eds. (Lippincott-Raven, Philadelphia) (1996); 
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Schnittler, H.J. et al., J. Clin. Invest. 91:1301 (1993). The findings set forth herein 
suggest that its tropism for this cell type is probably determined by the specificity of 
GP. In Ebola infection, preferential binding and infection of microvascular endothelial 
cells may lead ultimately to a loss of capillary integrity that results in the severe 
5 hemorrhage observed in the terminal stages of this disease. The differential binding 
of these two gene products from the same viral structural gene generated by RNA 
editing suggests that they have evolved functionally to differentially affect immunity 
and infectivity. The ability to facilitate viral replication and target the virus to 
endothelial cells by alternative products of the same viral gene represents an efficient 
1 0 genetic mechanism which can account for different pathologic features of this disease. 
Inhibition of sGP binding to neutrophils and GP to endothelium is likely to ameliorate 
the effects of acute Ebola virus infection. 

SPECIFIC EXAMPLE 2 

I. Methods 

15 Production of pseudotyped MuLV retroviruses expressing green 

fluorescent protein (GFP): 50% - 70% confluent 293 T cells in 60mm tissue culture 
dishes were transfected using the calcium phosphate method and the following 
plasmids: 0.3 //g 1012 GP(Z) (see Figure 8) or 1012 sGp-Gp(Z) (see Figure 9), 3/jg 
LZR-gfp, 2 fig pNGVL-gag-pol. After overnight transfection, fresh media was added 

20 to cells. Twenty hours later, the supernatants were harvested and filtered through 
a .45 /jm filter. 

Infection of HUVEC cells using the pseudotyped retroviruses: The day 

before infection, 30% - 50% confluent HUVEC cells were prepared in 6-well plates. 

1 ml of pseudotyped retroviral supernatant was added to one well of the 6-well plates 
25 with 15 //g/ml of polybrene. Sixteen hours later, the viruses were removed and 

normal media was added. After 24 hours, the cells were lifted and GFP expression 

measured using FACS analysis. 

Construction of 1012sGP-GP(Z): 101 2 sGP(Z) cells were digested with Pstl 

and treated with Klenow, then digested with Xbal. 1012 GP(Z) cells were digested 
30 by EcoRI and treated with Klenow, then digested with Kpnl. Pstl/Klenow/Xbal treated 

sGP fragment and EcoRI/Klenow/Kpnl treated GP fragment were then cloned into 

Xbal/Kpnl treated pVR-1012 plasmid. 

GP and sGP derivatives: The receptor recognition domain, mucin-like 

domain and/or TM domain of GP and sGP were mutated. The mutated GP and sGP 
35 was then tested for its ability to pseudotype and for cytotoxicity in producer cells. 
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II. Results 

To determine the efficacy of targeting endothelium with the gene transfer 
vectors pseudotyped with GP of the present invention, HUVEC cells were infected 
with GP(Z) pseudotyped MuLV retrovirus (LZR-gfp) and sGP-GP(Z) pseudotyped 
MuLV retrovirus (LZR-gfp). Figures 5A-5C show the infection rate (GFP expression) 
measured using FACS analysis. As shown in Figure 5B, the GP(Z) pseudotyped 
MuLV retrovirus (LZR-gfp) was effective in targeting and expressing GFP in 
endothelium. 

To determine whether mutating GP would effect its ability to pseudotype and/or 
decrease toxicity in producer cells, the receptor recognition domain, mucin-like 
domain and/or TM domain were mutated. Figure 10 shows the results. The optimal 
envelope is able to pseudotype but shows minimal toxicity. 

The foregoing discussion discloses and describes merely exemplary 
embodiments of the present invention. One skilled in the art will readily recognize 
from such discussion, and from the accompanying drawings and claims, that various 
changes, modifications and variations can be made therein without departing from the 
spirit and scope of the invention as defined in the following claims. 

All patents and other references cited herein are incorporated by reference as 
if fully set forth. 
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WE CLAIM: 

1. A genetic construct comprising a gene operativeiy-linked to a carrier, 
wherein the carrier is associated with a transmembrane form of viral glycoprotein or 
derivative thereof. 

5 2. The genetic construct of Claim 1 , wherein the transmembrane form of 

viral glycoprotein or derivative thereof is expressed on the surface of the carrier. 

3. The genetic construct of Claim 1 , wherein the transmembrane form of 
viral glycoprotein or derivative thereof is from Ebola. 

4. The genetic construct of Claim 1 , wherein the carrier is a viral vector. 



10 



5. The genetic construct of Claim 1 , wherein the carrier is a non-biologic 
gene targeting vehicle. 

6. The genetic construct of Claim 4, wherein the viral vector is a retroviral 

vector. 



7. The genetic construct of Claim 4, wherein the viral vector is a lentiviral 

1 5 vector. 

8. The genetic construct of Claim 5, wherein the non-biologic gene 
targeting vehicle is a liposome. 

9. The genetic construct of Claim 5, wherein the non-biologic gene 
targeting vehicle is a DNA-protein complex. 

20 10. A method of targeting a gene to a cell comprising the step of 

administering to a cell population a genetic construct comprising the gene operativeiy- 
linked to a carrier, wherein the carrier is associated with a transmembrane form of 
viral glycoprotein or derivatives thereof. 



25 



11. The method of Claim 10, wherein the transmembrane form of viral 
glycoprotein or derivative thereof is expressed on the surface of the carrier. 
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12. The method of Claim 10, wherein the transmembrane form of viral 
glycoprotein or derivative thereof is from Ebola. 

13. The method of Claim 10, wherein the carrier is a viral vector. 

14. The method of Claim 10, wherein the step of administration is ex vivo. 
5 15. The method of Claim 10, wherein the step of administration is in vivo. 

16. The method of Claim 10, wherein the cell is an endothelial cell. 

17. The method of Claim 10, wherein the cell is a hepatocyte. 

18. The method of Claim 10, wherein the cell is a monocyte. 

19. The method of Claim 10, wherein the cell is a dendritic cell. 

10 20* The method of Claim 14, further comprising the step of introducing the 

cell population to a subject. 
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SEQUENCE LISTING ID NO: 1 

pVR lGia-G?CICJ 
General Description 

SKA pVR l0l2*G?(rO 
Local object 

Created: 09/14/95 04:17PK 
Last Modification Date* ? (ro data) 
lengths 7003 bp 
storage type; Basic 
form Circular 
Cnrmata 

Restriction Map 

BgHI:1sfte ^ 

CM: lite 

Dralll: 1 site g*Sgg 
EcoRV: 1 site ^xtotc 
HindUfclsite Hggr 
Hpafclsite «3^| 
Kasl:1site ggggg 
Kpnl:1srte gg*g§ 
Mart 1 site |§gggg 
PmH: 1 site g* 
Pstklsite g-gg 

Puut- 1 site CGATCfi 

rvui. i sue gai^ 
Sacll: 1 site §§3^1 
Sail: 1 site g^cc^c 
Kimfclslli S 
EcoRl:2sites 
Ncol:2sItes gg*££ 
Ndel:2sites gg*£g 
Spht:2sites 
Xhol:2sites ggg»| 
BamHI:3sites g££gg 
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Bell: 3 sites *ggA 

Functional Map 

CDS (4 signals) 

CMV IE 5* UT 

Start: BBS Endx 1123 
CMV IE INT 

Start! 1130 End? 1840 

TbGH 

Start i 4020 Sndt 4572 

Kanr 

Start: 6068 £ndx GS90 (Complementary) 
Miscjfeature (2 signals) 
CMV enhancer 

Stares 248 Endt 885 

GP(IC) 

Starts 1870 Endt 4019 

Annotations 
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1 


CCCCGCGTTT CCCTCATGAC GGTGAAAACC TCTGACACAT CCACCTCCCO 
AGCGCGCAAA GCGACTACTC CCACTTTTGG AfiACTCTGCA CGTCGACGCC 




51 


G&GACCOTCA CAGCTTGTCT GTAA6CG5AT CCCCGOAGCA GACAAGCCCC 

CTCTGCCAGT ctccaacaga gattcgccta cgoccctcgt ctcttcgggc 




101 


TCACS3CGCG TCAGCCGGTG TTGGCGGGTG TCCGGGCTGG CTTAACTATG 
ACTCCCGCGC AGTCGCCCAC AACCGCCCAC AGCCCCGACC GAATTGATAC 




151 


Kdel 

CGGCATCAGA GCACXTTCTA CTGAGAGTCC ACCATATGCG G7CTCAAATA 
CCCGTAGJC? CCTCTAACAX CACTCCCACG TGGTATACGC CACACTTTAT 




201 


CC6CACAGAT CCGTAAGGAG AAAATACCGC ATCAGATTCC CTAT7GGCCA 
GGCCTCTCTA CGCATTCCXC TTTTATGGCG TAGTCTAACC GATAACCGGT 





251 7TGCA2RCG7 TCTATCCATA TCATAATATG TACATT7ATA TTGCCTCA7G 
AACGTATGCA ACATAGGTAT ACTATTATAC ATGTAAATAT AACCGAGTAC 



301 TCCAACATTA CCGCCATCTT GACATTGAT? ATTGACTAG7 TATTAATAGT 
AGG77CTAAT CGCGG7ACAA CCGTAACTAA TAACCGATCA ATAATTATCA 



351 AATCAATTAC GGGG7CATTA CTTCACAGCC CATATAWGA G7TCCGC8TT 
g T7AC7TAATG CCCCAGTAAT CAAG7A7CGG GTATATACCT CAAGGCCCAA 

:ff 401 ACATAACTTA CGGTAAAXCC CCCGCCTGGC TGACCGCCCA ACGACCCCCC 
CGTATTGAAT GCCATTTACC GGGCGGACCC ACTGGCGGGC 7GCTCCCGCC 

QiSl CCCATTCACC TGAATAA2GA CGTATGTTCC CATACTAACG CCAATAGGGA 
GGGTAACTCC AGtTATTACT GCATACAAGG GTATCATTGC GGTTATCCC? 

^ **••»***•**-•*-»*•••••••-**--*•••••*•*»-..•.».,..... 

IISOl CTTTCCAOTG ACGTCAATCC CTGGA3TATT TACGGTAAAC TGCCCACTTG 
GAAAGGTAAC MCAGTTACC CACCTCATAA ATGCCATTTC ACGGGTGAAC 



Jl 551 CCAGTACATC AAGTGTATCA 7ACCCCAAGT ACGCCCCCTA TTGACGTCAA 
CGTCACCTAO TTCACATACT A7ACGGTTCA TCCCGGGGAT AACTCCACTT 



^ 601 7GACGGTAAA TGGCCCGCCT GGCATTA7GC CCACTACA7G ACCTTATGGG 
*~ ACTGCCATTT ACCGGGCGGA CCGTAATACG GGTCATGTAC TCCAATACCC 



Ncol 

$51 ACTTXCCTAC TTGGCAGTAC ATCTACOTAT TAC2CATCGC TATTACCATC 
TGAAAGGATC AACCGTCATG TACATGCATA ATCAG7AGCC ATAATGGTAC 



Kcol 



701 GCGATGCGCT ST7GGCAG2A CATCAA7GGG CGTGGATACC GGTTTGACTC 
CAC7ACGCCA AAACCC7CAT GTAGTTACCC CCACCTATCG CCAAACTCAC 



751 ACGGCSATT? CCAAGTCTCC ACCCCATTGA CGTCAATCGG AGTTTGTTTT 
*CCCCCTAAA CG7TCAGAGG TGGCGTAACT GCAGTTACCC 7CAAACAAAA 



801 GGCACCAAAA CCAACGGGAC TTTCCAAAAT CTCGTAACAA CTCCGCCCCA 
CCCTCCTTT? AGTTGCCCTC AAAGGTTTTA CAGCATTGTT GAGGCGGGGT 
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•51 TTGACGCAAA TCGGCGGTAG GCCtGTACGG TCGGAGGTCT ATXTAXGCAG 
AACTGCGTTT ACCCGCCATC CGCACATGCC ACCCTCCAGA TATATTCGTC 



SOI AGCTCGTTTA CTGAACCGTC AGATCGCCTC CAGACGCCA? CCACGCTCTT 
TCGA3CAAAT CACTTGGCAG fCTAGCGCAC CTCTGCGGTA GGTGCGACAA 



SacXX 



*51 TTGACCTCCA TAGAAGACAC CGGGACCGAT CCAGCCTCCO CCGCCGGGAA 
AACTGGAG37 ATCTTCTGTG GCCCTGGCTA GG7CGGAGGC CCCGGCCCTT 



1001 CGGTOCAWG GAACGCGGAT TCCCCC7CCC AAGAGTGACG TAAGTACCGC 
CCCACGTAAC CTTGCOCCTA AGGGGCACGG TTCTCACTCC ATTCATGGCG 



SphX 



1051 CTATAGACTC TATAGOCACA CCCCTTTGGC 7CTTATCCAT GCTATAMCT 
GATAtCTGAG ATATCCGXGT GGGGAAACCG AGAATACGTA CGATATGACA 



1101 TTT7GGCTTG GGGCCTATAC ACCCCCGCTT CCTTATGCTA TAGGTGATGG 
AAAACCGAAC CCCGGATATC TCCGGGCGAA CGAATACCA7 ATCCACTACC 



1151 CATAGCTTAG CCTATAGGTG TGGGTTATTG ACCAtTATTG ACCACTCCCC 
ATA7CGAATC GGA?A?CCA£ ACCCAATAAC TGCTAATAAC TGGTGXGGGG 



f^201 TATTGGTGAC CATACT7TCC AtTTACTAATC CATAACATCG CXTTTGCCA 
ATAACCACTG C7ATGAAACG TAA7GATTAG CTATTGTACC GAGAAACGG? 



^1251 CAACTATCCC TATTGGCTAT ATGCCAATAC TCTGTCCTXC AGAGACTGAC 
O CTTCATXGAG A7AACCGATA TACG5TTATG AGACAGGAAG 7CTCTGACTG 



=-1301 ACGGXCTCTG TATTTTTACA GGATGGGGTC CCXTTTATTA TWACAAATT 
TGCCTGAGAC ATAAAAA7GT CCTACCCCAG GGTAAATAAT AAATGTTTAA 



&351 CACATATACA ACAACGCCGT CCCCCGXGCC CGCAGTTTTT ATTAAACATA 
CTCTATATGT TGTTCCGGCA GGGGCCACGG GCGTCAAAAA TAATOTGTAT 



nlMOl GCGTCGGATC TCCACGCGAA CC?CGGCTAC GTGTTCCGGA CATGGGCTC? 
f ; CGCACCCTAG AGGTGCGCTT XGAGCCCATG CACAAGGCCT G7ACCCGACA 



41*51 1CTCCGGTAG CGGCGGAGCT TCCXCATCCG ACCCCTGGTC CCATGCCTCC 
O AGAGGCCATC GCCGCCtCCX AGGTCTACGC TCGGGACCAG CGTACGGAGG 



' 1501 AGCGGCTCAT GGTCGCTCGG CAGCTCCTTG CTCCTAACAG TGGAGGCCAG 
TCCCCGAG7A CCAGCGAGCC GTCGAG3AAC GAGGATTGXC ACCTCCGGTC 



1S51 ACTTAGGCAC AGCACAATGC CCACCACCAC CAGTCTCCCG CACAAGGCCO 
TCAATCCGTC TCGTGTTACG GGTGGTGGTG CTCACACGGC GTCTTCCGGC 



1601 TGGCGGTAGG CTATGTGTCT GXXXATGAGC CTGGAGATTG GGCTCGCACG 
ACCGCCATCC CATACACAGA CTTXTACTCG CACCTCTAAC CCGXGCGTGC 



16S1 CCTGACGCAG ATGGAAGACT TXAGGCAGCG GCAGAAGAAG ATGCACGCAG 
CGACTCCCTC TACCTTCTGA ATTCCCTCCC CGTCTTCTTC TACGTCCGTC 



1701 CCGAGTTCTT GTATTCTGXX AAGAGTCAGA CGTAACTCCC CTTGCGGTGC 
GACTCAACAA CATAAGACCA TTCTCACTCT CCAT7CAGGG CAACGCCACG 
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1751 


TCTTAACGGT GCACCGCAGT GTAGTCTGAG CAGTACTCG7 TGCTGCCCCC 
ACAATTGCCA CCTCCC67CA CATCXGACTC CTCATGAGCA ACCACCGCGC 






Mcol 




1801 


CGCGCCACCA GACATAATAG CTGACAGACT AACAGACTGT TCCCTTCCAT 
GCCCCCTGGT CTCTATTATC GACTCTCTGA TTGTCTGACA AGGAAAGCTA 






Sail 






Heol P«tl Pmll Bell EcoKV 




1951 


CGGTCTTTTC TCCAG7CACC GTCGTCGACA CGTGTGATCA GATATCGCGG 
CCCAGAAAAG ACClCAGTGG CAGCAGCTGT GCACACTAGT CCATAGCGCC 






ECORI 




1501 


CCGCCCGGCC GCTCTAGAAT TCTCTAATCA CACTCATCAT G3GAGCGTCA 
CCCGCGCCOC CGAGATCTTA ACAGATTAGT CTCACTAGTA CCCTCCCAGT 




1951 


GCGA3TCTGC AAT7GCCCCG TGACCGCTTC AGCAAAACAT CTTTCWTGT 
CCC7AAGACG TTAACCCGGC AC7CGCGAAG TCCTTT7GTA GAAAGAAACA 






STCGGTAATA ATCCTATTCC ATAAAGTCTT TTCAATCCCG TTCGGGGTTG 
AACCCATTAT TAGGATAAGG TATTTCACAA AAGTTAGGCC AACCCCCAAC 




S°51 


TACACAACAA TACCCTACAA GTGAGTGATA MGACAAGTT TCTGTGCCGA 
ATGTGMCTT A7GGGATCTT CACTCACTAT AACTG7TCAA ACACACGGC? 




-|101 


CACAAACTCT CTTCAACTAG CCAATTGAAC TCAGTCGGGT TCAACTTGGA 
CTGTTTCAGA GAACTTGA7C GGTTAACTTC AGTCAGCCCA ACTTGAACCT 




^2151 


GGGCAATGGA CTAGCAACTG ATC7ACCAAC GGCAACCAAA AGATCCCGTT 
CCCGTTACCT CATCGTTGAC TACATCGTTC CCGTTGCTW TCTACCCCAA 




0f2Gl 


TTCGAGCXGG TGTTCCACCA AAGGTGGTAA ATTACGAAGC CGGAGAA7GG 
AAGCTCGACC ACAAGGTGGT TTCCACCATT 7AATGCTTCG ACCTCTTACC 




1*251 


CCTGAGAACT GTTATAACCT GGCTATAAAG AAAG7TGATG GTAGTGAGTG 
CGACCCTTGA CAATATTGGA CCGATATTTC TTTCAACTAC CATCACTCAC 




5 2301 


CCTACCAGAA GCCCCTGAGG GAGTGAGGGA TTTOCCCCG? TGCCGCTATG 
GGATCGTCTT CGGGGACTCC CTCACTCCCT AAAAGGGGGA ACGGCGATAC 




2351 


TACACAAAGT CTCAGGAACT GGACCATCCC CAGGAGGACT CGCCTTTCAC 
ATGTGTTTCA GAGTCCTTCA CCTGGTACGG CTCCTCCTOA GCGGAAAOTO 




2401 


AAAGAAGGAG CCTTCCTCCT GTATGACCGA CTCGCATCAA CAATCATTTA 
TXTCTTCCTC GGAACAAGGA CATACTGGCT GAGCCTACTT GTTAGTAAAT 




2451 


TCGGGGTACA ACCCTTCCCG AAGGAGTTAT TOCATTTCTG A2CTTGCCTA 
AGCCCCATGT TGOAAACGGC TTCCTCAATA ACGTAAAGAC TAGAACGGAT 





2501 AGGCGCGAXA GGATTTTITC CAGTCTCCTC CATTOCATGA GCCTGCCAAC 
TCCCCGCTTT CCTAAAAAAG GTCACACGAG GTAACGTACT CGGACGCT7G 
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BaaTXX 



2551 ATGACCACGG ATCCCTCCAG XXACXATCAC ACGACAACAA XAAACTACGT 
TACXGCTGCC XAGGCACCTC AAXGATAGTG TGCTCCTGTO AXTTGATGCA 



2ff01 GGXTCAXAAT TTTGGAACCA ACACCACAGA OrriC T CXTi: CAAGTCGATC 
CCAACXAXTA AAACCTXGGT TGXGGXGTCT CAAAGACAAG GXXCAGCTAG 



Xkol 



2 €51 AXTTGACGXX TGTGCAGCXC SACGCAAGAT TCACACCACA ATTCCXTGTC 
TAAACTCCAT ACACGXCGAG CTCCGTTCXA AGTCTCGTGT TAAGGAACAG 



2701 CXCCXAAAXG AAACCAXCXA CTCXCATAAC CGCAGAAGTA ACACX&CAGG 
GAGGAXTTAC TTTGGXACAT CAGACTAXXC CCCTCTTCAT SGXGXXCXCC 



2751 AAAACTAASC TGGAAAATAA ATCCCACXCX TG&TACCAGC ATGGCXCACT 
XTTTGAT7AG ACCTTTTATT XAGGGTCACA ACTATGGXCG TXCCCACXCA 



2801 CCGCTTTCTG G GAAAA TAAA AAAACXCCAC AAAAAOCCXT TCAAGTGAAG 
CCCGAAAGAC CCTTTTATTT TTTTCAAGTG mVf OCGAA AGTTCACXXC 



2851 AGTTGXCTTX CGTACCTGXA CCAGAAACCC AGAACCAGGT CCTTCACACG 
TCAACAGAAA CCATGGACAT GCTCTTTGGG TCTTGCTCCA GGAACTGXGC 



M25Q1 ACASCGACGC XCTCTCCXCC CATCTCCGCC CACAACCACG CAGGCGAAGA 
yy TGTCGCTCCC AGAGAGGAGG GTAGAGGCGG G VCTTGGgGC GTCCGCWC7 



f^2S51 CCA CAAA GAA TXGGXTTCAG AGGATTCCAC TCCAGTGGT7 CAGATGCAAA 
SS CGTGXXTCTT AACCAAAGXC XCCTAAGGTG AGGTCACCAA GTCTACGTTT 



^13001 ACATCAAGGG AAAGGACACA ATGCCAACCA CAGTGACGGG TGTACCAACA 
y 1 TCTACTTCCC XTTCCTGTG? TACGGTTGGT GTCACTGCCC ACATGGTTGT 



Bell 

33051 ACCACACCCT CTCCATtTCC AATCAATGCT CCCAACACTG ATCATACGAA 
XGCTCTGGGA GAGGTAAAGG X7AGXXACCA GCGTTGTGAC TACTATGGTT 



3101 ATCATTTATC CGCCTGGAGG GGCCCCAAOA AGACCACAGC ACCACACAGC 
;t XAGXAAAXAG CCGGACC7CC CCGGGGTTCT TCTGGTGTCG TGGTGTGTCG 



^ 31S1 CTCCCAAGAC CACCAGCCAA CCAACCAACA GCACAGAATC GACGACACTA 
GACGGTXCM GTGGTCGGTT GGTTG G XX GT CCTGTCXXAG CXCCTGTGAT 



3201 AACCGAACAX CAGAGCCCXC CAGTAGAGGC ACGGOACCAT CCACCCCCAC 
TXCCGXTGXA CTCTCGGGAG GTCATCICCG TGCCCTCGTA GGXCGGGGXC 



3251 GGTCCCCAAC ACCACAGAAA CCCACGCCGA ACTTGGCAAG ACAACCCCAA 
CCAGGGGXXG XGGTGTCTXT CGGTOCGOCT XXJAACCCXTC TGXXGGGGTT 



3301 CCACACXCCC AGAACAGCAC ACTCCCGCCA GTGCCATXCC AAGAGGCGTG 
GGXGTGA6GG TCTTCXCOTG TOACCGCCGT CACGGXAACG TTCTCGGCAC 



3351 CACCCCGACG AACTCAGTGG ACCTGGCXTC CTGACGAACA CAAXACCGGG 
GXGGGGCtCC XTGAGXCACC XGGACCGAAG CACTGCXXGX CTTAMCCCC 
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BasfiX 



3401 GGTGACAAAT CTCCTGACAG GATCCAGAAG AAAGCGAAGG CATCTCACTC 
CCACTCTWA GAGCACTCTC CTAGOTCTTC TTTCGCTTCC CTACAGTGAG 



3451 CCAATACACA ACCCAAATGC AACCCAAACC TGCACTATTG CACACCCTTG 
CCTTATQTCT TGGCTTTACG TTGGGTTTGC ACCTCATAAC CTGTCOGAAC 



3501 QA7GAOOGTO CTOCCATAGG TTTAGCCTGG ATACCATACT TCGGGCCAGC 
C2ACTCCCAC GACGOTATCC AAATCCGACC TATGGTATGA AGCCCGGTCC 



3551 AGCTGAGGGA AW TACAC TQ AAGGCATAAT CCACAA2CAA AATGGAT7GA 
TCGACTCCCT TAAATGTGAC TTCCCTATTA CCTCTTAGTT WACCTAACT 



1601 GAGGCAGCTG GCCAACGAAA CGACACAAGC TCTTCAATTC 

A&ACACCTAA CTCCGTCGAC CCCTTCCTTT GCTGTGT7CG AGAACTTAAC 



3651 TTCTTAAGGC CAACTAC?GA GTTGCCTACA TTCTCTATAC TAAATCCCAA 
AXGAA3TCCC GTTGATGACT CAACGCATGT AAGAGATATC ATTTAGCCTT 



3701 AGCAATAGAC T TCTTGCT C C AAASATGGGG AGGAACATGT CACATTCTAG 
TCCTTATCTG AAGAACGACG TTTCTACCCC TCCTTGTACA GTGTAAGATC 



1751 GGCCTCATTG TTGCATTGAA CCCCAAGATT GGACCAAAAA TATCACTGAT 
CCCGACTAAC AACCTAACTT GGGGTTCTAA CCTCSTTT2T ATAGTGACTA 



Bell 



3801 AAAATCGATC AAATAATCCA TGACTTTCTC GATAATAATC TTCCAAATCA 
% TTWAACTAG TTTATTAGCT ACTGAAACAG CTATTATTAC AAGGTTTACT 



3851 GAATCATCGC AGCAACTGGT CGACTGGATG GAAACAATGG GTTCCTGC2G 
Z J~ CTTACTACCG TCGTTGACCA CCTGACCTAC CCCTGTTACC CAAGGACGAC 



39Q1 CAATAjGGAAT cacacgagta atcattccta toatttgctct gctctccatt 

C7TATCCTTA GTGTCCTCAT TAGTAACGAT AA7AACGAAA CGACACGTAA 



EcoRI 



3951 TCCAAATTCA TCCTTTGAAC 7AATATAGCA 1CATACTTTA GAATTCTAGA 
7t ACGCTTAACT ACGAAACTXG AWAffATCGT AGTATCAAAT C?TAAGATCT 



Karl 



Xa*I BafflHI Bglll 



4001 CCAGGCOCCT GGATCCAGAT CTGCTGTGCC TTCTACTTCC CAGCCA2CTG 
GOTCCGCGGA CCTAGCTCTA GACGACACGG AACATCAACG GTCCGTAGAC 



4051 TTG7TTGCCC CTCCCCCOTG CCTTCCWCA CCCTGG3AGG tGCCACTCCC 
AACAAACGGG GAGOOGGCAC GCAAGGAACT GGGACCTXCC ACGGTGAGGG 



4101 ACTCTCCTTT CCTAATAAAA TCAGGAAATT GCATCGCATT CTCTGAQTAG 
TCACAGGAAA GGATTATTTT ACTCCTTTAA CCTACCCTAA CAGACTCATC 



4151 GTGTCATTCr ATTCTGGGGG GTGGGGTGGG GCAGCACAGC AAGGGGGACG 
CACACTAAGA TAAGACCCCC CACCCCACCC CGTCGTCSCC TTCCCCCTCC 
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SphX Kpnl 



4201 ATTCSCAAGA CAATAGCAGG CATGCTCGCG ATGCGG7GGG CTCTATGGGT 
TAACCCTTCT GTTATCG7CC CTACCACCCC TACCCCACCC GAGATACCCA 



4251 ACCCACCTGC TGAACAATTG ACCCGCTTCC TCCTGGGCCA GAAACAAGCA 
TCGGTCCACG ACTTCTTAAC TCCCCCAAOG AGGACCCCCT CTTTCTTCCT 



4301 CCCACATCCC CTTCTCTGTG ACACACCCTG TCCACCCCCC TGGTTCTTAG 
CC6TOTA0GC CAAGAGACAC TOTGRGGGHZ AGGTCCGGGG ACCAAGAATC 



4351 TCCCAGCCCC ACTCATAGGA CACXCATAGC TCAGGAGGGC TCCGCCTXA 
AAGCTCCGCG TGAGTATCCt CTGAGTATCG AGTCCTCCCG AGGCGGAAG? 



4401 ATCCCACCCC CTAAAGXACT TCGAGCGGTC TCTCCCTCCC TCATCAGCCC 
7AGGCTCGGC GATTTCATGA ACCTCGCCAC AGAGGGAGGG ACTACTCOGO 



4451 ACCAAACCAA ACCTAGCCTC CAAGACTGCC AAGAAATTAA ACCAAGATAG 
7GGTTTGGTT TGGATCGGAG G77CTCACCC TT CT TTAATC TCGTTCTATC 



4501 GC7ATXAA0T GCAGAGCGAG AGAAAAXCCC TCCAACATGT CACGAAGTAA 
CGATAATTCA CCTCTCCCTC TCTTTCACGG AGGTTCTACA C7CCTTCATT 



0 Xaal 

S 45Si TGASAGAAAT CATASAATTT CTTCCCCTTC CTCGCTCACT CACTCGCTGC 

01 ACXTCTTTA GTATCTTAAA CXAGGCGAAG GACCCAGTGA CTGAGCCACG 



^4401 CCffCOGTCGT TCCGCTGCGG CGAGCGGTAT CAGCTCACTC AAAGXGCTA 
r; CGAGCCAGCA AGCCGACGCC GCTCGCCATA CTCCAGTGAG TTTCCSCCAT 



4 £51 ATACGGTTAT CCACAGAATC ACGGGATAAC GCACCAAAGA ACATGTGAGC 
ffi TATGCCAATA GGTOTCTTAG TCCCCTA27G CCTCCTXTCT TG7ACACTCG 



m 4701 AAAAGGCCAG CAAAAGGCCA CGAACCGTAA AAAGGCCGCG TTGCTCCCGT 
T% T XUTC CCfiTC GTnTCCOOT CCTTGGCATT TTCCCGGCGC AACGACCGGA 



H 4751 TTTCCCATAG GCTCCGCCCC cctcacgagc atcacaaaaa tcgacgctca 
AAAAGCTATC CGAGGCGGGG GGACTGCTCG TACTGTITTT AGCTGCGAGT 
.* * . . . 

""4801 AGTCAGAGGT CCCGAAACCC GACAGGACTA TAAACATACC AGGCGTTTCC 
TCAGXCTCCA CCGCTTTGGC CTGTCCTGAT ATTTCTAXGG CCCCCAXAGG 



4SS1 CCCTGGAAGC TCCCTCGTGC CCTC5CCTGT 1CCGACCCTG CCCCTTACCO 
GGGACCnCG AGG&ACCACG CGACACCACA AjGGCTGGCJUC CGCGAATGCC 



4901 CATACCTGTC CGCCTTTCTC CCTTCGGGAA GCCTGGCGCT TTCTCAA7CSC 
CTATGGACAG GCG&AAAGAG GGAA6CCCT? CGCACCCCGA AAGAGTTACC 



4951 7CACGCTGTA CGTXTCTCXG TTCGGTGTAG CTCOTTCGCT CCAXCCTGQG 
AGTGCCACAT CCATAGACTC AAGCCACATC CAGCAAGCCA GGTTCGACCC 



5001 CTGTCTGCAC GAACCCCCCG TTCKGCCCGK CCGCTGCCCC TTATCCGGTA 
CACACACGTG CTTGOGGGGC AACTCGGGCT CGCGACGCGG AXTAGGCCAT 
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3031 ACXATCCTCT TGAOTCCAAC CCGGTAAOAC ACGACTTATC CCCACTGGCA 
TCAEAGCAGA ACTCAGOTTG OGCCATTCT O TGCTGAAXAC CGOTOACCGT 



5201 GCASCCACTG GTAACAGGAT TAGCACACCO AGGTATGTAG CCOGTOCTAC 

CCTCGGTGAjC cattgtccta atc c tct c gc tccatacatc cgccaccatg 



5151 AGAGJTCTXG AACTGGTGGC CTAACTACGG CTACACTACA aggacagtat 
TC7CAACAAC TTCACCACCC GATTGATGCC CATC7GATCT TCCTCTCATA 



5201 TTOGTATCTC CGCTCTOCTC AAQCCAGTTA CCTTCGGAAA AAGACTTCOT 
AACCATAGAC CCCACACGAC TTCGGMAAT GGAAGCCT7T TTCTCAACCA 



5251 AGCXCWGAT CCGG CAAAC A AACCACCGCT CGTAGCGGTG C1TTTTTTQT 
TCGAGAACXA CCCCCCTTGT TTC6TQ6CGA CCATCCCCAC CAAAAAAACA 



5301 TT CCAAG CAG CACATTACGC GCACAAAAAA AGOATCTCaA CAAfiATCCTT 
AACCTTCCTC CTCTAATGCC CCTCTTTTTT TCCTAGAGTT CTTCTAGGAA 



5351 TCA2CTTTTC TACGGGGTCT GACCCTCACT GGAACGAAAA CTCACGT7AA 
ACZAGAAAAG ATGCCCCACA CTGCGAGTCA CCTTGCT7TT GAGTGCAXTT 



5401 GGGArrXTGG TCATGAGATT ATCAAAAACC ATCTTCACCT AGATCCTTTT 
CCCTAAAACC ACTACTCTAA 7ACTTTTTCC TAGAAGTCGA TCTAGGAAAA 



f-5451 AAA7TAAAAA TCAAGTTTTA AATCAATCTA AACTATATAT CAGTAAACTT 
;;;; TTXAATTTTT ACTTCAAAAT TTAGTTAGAT T7CATATATA CTCATCTCAA 



H.55C1 CGTCTGACAC CTACCAATGC TTAATCAG7G AGGCACCTAT CTCAGCCATC 
Lj CCA5ACTCTC AATGGTTACG AATTAGTCAC TCCG7GSATA CACTCGC^AG 



\5551 ttZXTATTTC GPTCATCCAT AGTTGCCTGA CTCCGGGGGG GGGGGCCGCT 
m ACAGATAAAG CAAGTAGCTA TCAACGGACT CAGGCCCCCC CCCCCCGCGA 

y S601 GM3G7CTGCC TCG7CAAGAA GGTGTTGCTC ACTCATACCA CGCCTGAATC 
CTCCAGACGG ACCACCTCTT CCACAACGAC TGACTATGGT CCGGACTTAG 



fi5€5l CCCCCATCAT CCAGCCAGAA AGTGAGGGAG CCACCGTTGA TGAGAGCTTT 
CGGGGTACTA GGTCGCTCTT TCACTCCCCC GGTGCCAACT ACTCTCGAAA 

f 5701 GTTGTACCTG GACCAGTTCG TGATTTTGAA ll'mucm 1 GCCACGGAAC 
□ CAACATCCAC CTCCTCAACC AC T AAAA C TT GAAAACCAAA CGGTGCCTTG 



5751 GGTCTGCGTT CTCCGGAAGA 7GCCTGATCT GATCCTTCAA CTCAGCAAAA 
CCAGACGCAA CAGCCCTTCT ACGCACTAGA CTACGAACTT GAGTC C TTTT 



5801 GTTCGATTTA TTCAACaAAG CCGCCGTCCC CTCAAOTCAG CCTAASGCTC 
CAACCTAAAT AAG7TG777C GGCGGCAGGG CAGTTCACTC GCATTACGAG 



5051 TGCCACTCTT ACAACCAATT AACCAATTCX CAWAGAAAA ACTCATCCAC 
ACGGTCACAA TGTTGOTTAA TTOGTCAAGA C7AATCTTTT TGAGTAGCTC 



5S01 C&TCAAATGA AACTGCAATT TATTCATATC AGGATTATCA ATACCATATT 
GTACTTTACT TTGACCMAA ATAAG7ATAC TCCTAAIAGT TATCCTATAA 



£951 77TCAAAAAG CC G TTT C TCT AATGaAGGAG AAAACTCACC GAGGCAGTTC 
AAACTTTTTC GGCAAAGACA 77ACTTCCTC CTTTGAGTGG CTCCGTCAAG 
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cooi cxiafimwc caagatccto ctasoggtct gcgattccca ctcgtccaac 
craaecxACc gttctacgac catagccaga cgctaaggc? gagcagcttg 



€051 ATCAATACAA CCTATTAATT TCCCCTCOTC AAAAATAACG WATCAACTC 
TACTTATCTr OOATAATTAA AGGGGAGCAG TSTTTAXTCC AATAGTTCAC 



Hindin 



C1C1 AflAAAS CACC ATGAGTGACG ACTCAATCCG GTGAGAATCG CAAAAGCTTA 
VLT.VACTGG TACTCACTCC TGACTTAGGC CACTCTTACC CTTTTCCAAT 



6X51 rCCATTTCTT TCCAGACTTG OTCAACA GG C CACCCATTAC GCTCGTCATC 
ACCTAAAGAA ASOTCTCAAC AAGTTCTCCC CTCGGTAATG C6ACCAGTAG 



6201 AAAATCACTC GCATCAACCA AACCGTTATT CATTCOTGAT TGCOCCTCAC 
T37TACTCA0 CGTAG7MG? TTGGCAATAA CTAAGCACTA ACCCCGACTC 



Fval 



£251 CGAGACSAAA 7ACGCGATCG CTCTCAAAAG GACAATTACA AACAGGAXTC 
GC3CTCCTTT ATGCGCTACC GACAATTTTC CTGTTAATGT TTCTCCTTAG 



6301 GAATGCAACC GGCGCAGSAA CACTGCCACC GGA2CAACAA TAT777CACC 
CTIA CC TTGG CCGCCTCCTT GTGACCCTCG CGTAGTTCTT ATAAAAGTGC 



L J 6351 TGAA2CAGGA TATTCTTCTA ATACCTGGAA TG C T G I 1V 1 C CCGGGGATCG 
AdtTACTCCT ATAAGAAGAT TATGGACCTT ACGACAAAAG GGCCCCTAGC 



6401 CAGTCCTGAG TAACCATGCA TCATCACGAG TACGGATAAA ATGCTTGATG 

crcAccAcrc amggtacgt accagtcctc atgcctattt taccaactac 



^ 6451 CTCCGAAGAG GCATAAATTC CCTCAGCCAG TTTACTCTGA CCATCTCATC 
J! CAGCCTTCTC CG7A77TAAG GCAGTCGGTC AAATCAGACT CCTAGAGTAG 



. 6501 TGTAACATCA TTGGCAACCC TACCTTTGCC ATGTTTCAGA AACAACTCTG 
ACATTCTAGT AACCCTTGCG ATGGAAACGG TACAAAGTCT TTGTTGAGAC 



Clal 



r -~ 6551 GCGCATCGGG CTTCCCATAC AATCGATAGA 7TGTCGCACC 7GATTGCCCG 
CCCGTAGCCC GAAGGGTXTG TTAGCTATCT AACAGCGIGG ACTAACGGGC 

M €601 ACATOLTCCC GAGCCCATTT ATACCCATAT AAATCAGCAT CCATGTTGGA 
TGXAASAGCG CTCCCGTAAA TATGGCTATA TTTAGSCCTA GGTACAACCT 



XfcOl 

6651 ATTTAATCGC CGCCTCGAGC AACAC G TT TC CCCTTGAATA TGGCTCATAA 
TAAA3T AGCG CCGGAGCTCG TTCTGCAAAQ GGCAACTTAT ACCGAGTATT 



6701 CACCCCTCGT ATTACTGTTT ATGTAACCAG ACAGTTTTAT TGT7 C ATCAT 
CSCCGOAACA TAATOACAAA TACATTCGTC TGTCAAAATA ACAAGTACTA 



Drain 



6751 GATATATTTT TATCTTCTCC AATGTAACAT CAGAGATTW GAGACACAAC 
CIK7ATAAAA ATMAACACG TTACATTGTA GTCTCTAAAA CTCTCTCTTG 
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6801 CTCOCTTTCC CCCCOCCCCC AT7ATTGAAG CATTTATCAG GGTTATTOTC 
CaO C O AA AGC GGGGGGGGGG TAATAACTTC GTAAATAGTC CCAATAACAC 



4S51 C C MCA flCCC ATAGfcTATTT GAATCTATTT A&XAAAATAA ACAAATAGCG 
ACTACTCGCC TATGTATAXA CTTACATAAA TCTTXTTATT TCTTTATCCC 



M01 CTTCCOCOCA catwccccc axaactccca ccwacctct aacaaaccai 

CAAGQCOCGT CTXAACCCGC T7TTCACOGT GCACTCCAflX TTCTTTQOTA 



6551 TATTATCATC ACATTAACCT A?AAAAATAO GCGTATCACG ACCCCCTWC 
AXAATAGTAC TCTAATTCGA TATTTTTATC CCCXTAflTCC TCCGGGAAAO 



7001 GTC 
CAS 
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General Description 

Sn pVR 2012*6P(S) 
Loc&l object 

Cr«atcdt 09/14/98 03:5*FK 

Last. EodlSic&tioa Date: ? <ao d«t*J 

leagtht 7073 bp 

storage types B*sia 

lora: Circular 



Restriction Map 
Ball: 1 site *g 
eel!: 1 rite Jg™ 
ClalrlsJte 

Hindfflrlsite 
Kasl:1s?te 
Kpnl:1s»e 
Narl:1she gcccc 
Pmll: 1 site 
Pvul: 1 site 

Saclklsite CCGC6G 
Sail: 1 site ™™ 
Xbatlsite J§™» 

Xmnl* 1 she GXW3NNNITC 
*mra. l sue cmnaouuus 

Ndel:2s?te$ 
EcoRV^sites 
Sphl:3 sites 
Neo.:4sftes 

BamHJ: 6 sites gggg 

Functional Map 

COS (4 signals) 
CMVIES'UT 
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Start: 886 Ends 1129 
CMVIE IWT 

Start; 1130 Ends 1840 
TbGH 

Start: 4090 Endi 4642 

Kan r 

Start r €13 8 Endi €760 CConpl errant a ry) 

Miscjeature (2 signals) 

CMV enhancer 

Startx 248 End; 885 
GP(S) 

Starti 1870 End: 4089 

Annotations 



O 
%Q 

m 

a 
a 
m 
m 
m 

a 
m 

a 
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x Mccaccrrr cggtgatcac ggtgaaaacc tctgacacat gcagctcccc 

A3CGCGCAAA GCCACTACTG CCACTTTTGG ACACTGTGTA CGTCGAGGGC 



51 GAGACCGCCA CAGCTTGTCT GTAAGCGOAT GCCCCCAGCA GACAACCCCG 
CrCMCCAST CTCGAACAGA CATTCGCCTA CGGCCCTCCT CTGTTCGGCC 



101 TCAGGGCCCG TCAGCGGGTG TTCCCGGG7G TCO30GCTGG CTTAACTATG 
AGTCCCGCGC AGTCGCCCAC XACCGCCCAC ACCCCCGACC GAATTGATAC 



NdeX 



1S1 CGCCXTCA5A CCAGATTCTA CTGAGACTGC ACCATATGCG G7GTCAAATA 
CCCGCAGTCT CGTCTAACAT GACTCTCACG TGCTATACGC CACACTTTAT 



Ball 



201 CCCCACAGAT GCGTAACCAG AAAATACCGC ATCACATTGG CTATTGGCCA 
CCCGTGTCTA CGCATTCCTC TTTTATGGCG TAGTCTAACC GATAACCGGT 



251 TTGCATACGT TCTATCCATA TCATAATATC TACATTTATA TCGGCTCA2G 
AACCTATGCA ACATAGGTAT AGTATTATAC ATGTAAATAT AACCGAGTAC 



301 CCCAACATTA CCGCCATGTT GACATTGATT ATTGACTACT TATTAATAGT 
AGGTTGTAAT CCCGGTACAA CTGTAACTAA TAACTCATCA ATAATTATCA 

"2 251 AATCXATTAC GGGG7CATTA GTTCATAGCC CATATATGGA GTTCCGCGTT 
y TTAGTTAATG CCCCACTAAT CAAC7ATCGG G7ATATACCT CAAGGCGCAA 



O 401 ACAtAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG 
ri TCCA3TGAAT GCCATTTACC GGGCCGACCG ACTGGCGGGT TCCTGGGGGC 



451 CCC3GTCACG TCAACAASGA CGTATGTTCC CATAGTAACG CCAATAGGGA 

^zl GGGTAACTGC AGTTATTACT GCATACAAGG G7ATCATTGC GCTTATCCCT 

s sol CTTTCCATTG ACGTCAATGG GTGGAGTATT TACGG?AAAC TCCCCACTTG 

Pi GAAACCTAAC TCCAGTTACC CACCTCATAA ATGCCATTTG ACGGCTGAAC 



11 Kdel 

IZ 551 GCAGTACXTC AAGTGTATCA TATGCCAACT ACGCCCCCTA TTGACCTCAA 
CGTCATGTAG T7CACATAGT ATACGGTTCA TGCGGGGGAT AACTGCAGTT 



601 TGACGGTAAA TGGCCCGCCT CGCATTATGC CCACTACXTC ACCTTATCGG 
ACXGCCATTT ACCGGGCGGA CCGTAATACG GGTCATCTAC TGGAATACCC 



Ncol 

«1 ACTTTXCTAC ttggcactac atctacgtat TAGTCATCGC tatcaccatg 
TGAAACCATG AACCMCAXC TAGATGCATA ATCAGTAGCG ATAATGCTAC 



KC6Z 



101 GTGATCCGGT TTTGGCAGTA CATCAATGCG CG7GGATACC GGTTTCACTC 
CACTACGCCA AAACCGTCAT GTAGTTACCC GCACCTATCG CCAAACTGAG 



751 ACGCGGATTT CCAXGTCTCC ACCCCATTGA CGTCAATGCG AGTTTCTTTT 
TGCCCCTAAA GGTTCAGAGG TCGGGTAACT GCACTTACCC TCAAACAAAA 
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SOI CGCACCAAAA TCAACGGCAC TTTOCAAAAT GTCGTAACAA CTCCGCCCCA 
CCGTGGT7TT AGTTCCCCTO AAAGGTTTTA CAGCXTTCTT GAGGCGGGGT 



•51 TTGACGCAAA TCCOCGCTAC CCGTCTACCC TGGGAGCTCT ATA7AAGCAG 
MCTGCGTTT ACCCCCCXTC CGCACATCCC ACCOXCAGA TATAMCGTC 



501 AGCTCCTTTA GTGAACCGTC AGATCGCCTC GAGACGCCAT CCACGCTCTT 
TCGAGCAAAT CACTTCCCAG TCTAGCCGAC CTCTCCGGTA GGTGCGACAA 



SacXl 



531 TTGACCTCCA TAOAAOACAC CG3GACCGAT CCAGCCTCCG CGGCCCCGAA 
AACTGGAGGT A3CTTCT G TG GCCCTGCCTA GGTCGCAGGC GCCGCCCC7T 



1001 CGGTCCATTG GAACQCGGAT CCCCCGTGCC AAGAGTGACG TAAGTACCCC 
GCCACGTAAC CT7GCGCCTA AGGGGCACCG TTCTCACTGC ATTCATGGCG 



SphI 



1051 CTATAGACTC TATAGGCACA CCCCTTTGCC TCTTATGCAT GCTATACTCT 
GATATCTGAG ATATCCCTGT GGCGAAACCG AGAATACGXA CGATATGACA 



1101 rTTTGGCTTC GGGCCTATAC ACCCCCCCTT CCTTATCCTA TAGGTCATGG 
AAAACCGAAC CCCGGATATG TGGGCGCGAA GGAATACGAT ATCCACTACC 



"3151 TATAGCTTAG CC7A7AGG7G TGGG77ATTG ACCATTATTG ACCACtCCCC 
ATATCGAATC G G AT ATCCAC ACCCAATAAC TGGTAATAAC TGGTGAGGGG 



OL2Q1 7ATTCGTGAC GATACTTTCC ATTACTAXTC CATAACA7GG CTCTTTGCCA 
r% ATAACCACTG CTATGAAAGG TAATGACTAG GTATTCTACC GACAAACGGT 



^251 CAACTATCTC TATTGGCTAT ATGCCAA7AC TCTGTCCTTC AGAGACTCAC 
CTTGATAGAG ATAACCGATA TACGGTTATG AGACAGGAAG TCTC7GACTG 



» 1301 ACCGACTCTG TATTTTTACA GGATGGGCTC CCATTTATTA TTTACAAATT 
TGCC7GACAC ATAAAAA7G7 CC^ACCCCAG GGTAAA7AAT AAATGTTTAA 

H?iaSl CACATATACA ACAACGCCCT CCCCCCTGCC CGCACTTTOT ATTAAACATA 
H ! GTGTATATGT TGTTGCGGCA CGGGGCACGG GCGTCAAAAA TAATTTGTAT 



Pi 14 01 CCGTGGCATC TCCACGCGAA TCTCCGGTAC CTGTCCCGGA CATGGGCTCT 
CGCACCCTAG AGCTGCGCTT AOAGCCCATG CACAAGGCCT GTTACCCGACA 



1451 TCTCCGGTAG CGGCGGAGCT TCCACATCCC AGCCCTGGTC CCATGCCTCC 
AGAGGCCATC CCCGCCTCGA AGGTGTAGGC TCGCGACCAO GGTACGGAGG 



1501 ACCGGCTCAT GGTCGCTCGG CAGCTCCTTG CTCCTAACAG TGGAGGCCAC 
TCGCCGAGTA CCAGCGAGCC GTCGAGGAAC GAGGATTGTC ACCTCCGOTC 



1551 AC77AGGCAC AGCACAATCC CCACCACCAC CAOTCTGCCG CACAAGOCCG 
TCAATCCGTG TCGTGTTACG CCTGGTGGTG GTCACACGGC GTCTTCCCGC 



1601 TGGCGGTAGG CTATOTGTCT GAAAATGAGC G7GGAGATTG GGCTCGCACG 
ACCGCCATCC CATACACAGA CTTTTACTCG CACCTCTAAC CCGAGCCTGC 



1651 CC7GACGCAG ATGGAAGACT TAAGGCAGCG GCAGAAGAAG ATGCAGGCAG 
CGACTGCGTC TACCTTCTGA ATTCCGICCC CGTCTTCWC TACGTCCGTC 
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1701 CTGAGTTCTT GTATTCTGAT XACAGTCACX GGTAACTCCC GTTGCCCTGC 
GACTCAACAA CATAAGACTA TTCTCACTCT CCATTGAGGG CAACGCCACC 



1751 TCTTAACGC? GGAGGGCAGT GTACTCTGAG CAGTACTCGT TCCTGCCCCC 
ACAA2TGCCA CCTCCCGTCA CATCAGACTC GTCATCACCA ACGACCGCGC 



XcoX 

1801 CGCGCCAOCA CACATAATAG CTGACACACT AACAGAC7GT TCCTTTCCA? 
CCCCGGTGG? CTGTATTATC GACTGTCTGA TTGTCTGACA AGGAAAGCTA 



S&1I 



NeoX Stall Bell EcoRV 



2«S1 CCGTCTTTPC TGCACTCACC G7CG7CGACA CCTGTGATCA GATATCCCGG 
CCCACAAAAG ACC7CAGTGG CAGCAGCTCT GCACACTAGT C7ATAGCGCC 



SphI ECO&V 



1501 CCGCTCTAGC TACATGCATG C7CGAGCGCC CGCCAGTGTG ATGGATATCT 
GGCGAGATCG ATCTACGTAC GAGCTCCCCG GCGGTCACAC TACCTATAGA 



Ncol 



1951 GCAGAATTCT ATCTTCAGGA TCTCGCCATC GAGGGTCTTA GCCTACTCCA 
S CGTCOTAAGA TACAAGTCCT AGAGCGGTAC CTCCCASAAT CGGATGAGGT 



Mooi attgcccaga gataaatctc gaaaaagctc tttctttgtt tggctcatca 

O TAACGCCTCT CCATTOAAAG C777CTCGAG AAAGAAACAA ACCCAGTAGT 



02051 TCT7A77TCA AAAGGCCTTT TCCATGCCTT TCCGTGTTCT GACCAACAGC 
Z: AGAA7AAAGT TTTCCGGAAA AGGTACGCAA ACCCACAACA CTGGT7GTCG 



5 2101 ACT77AGAAG TAACAGAGA? 7GACCAGCTA GTCTGCAAGG ATCATCTTGC 
O TGAAATCTTC A7TGTCTCTA ACTCCTCGAT CAGACGTTCC 7AGTAGAACG 



Lx2151 ATCAACTGAC CAGC7GAAAT CAGTTGGTC7 CAACCTCGAG GGGAGCGGAG 
r Z TAGTTGACTG GTCCACTTTA CTCAACCAGA GTTGGAGCTC CCCTCGCC7C 



iCoHV 

2201 TATCTACTGA TATCCCA7C7 GCGACAAAGC GTTGGGGCT7 CAGATCTGGT 
ATAGATGAC? ATAGGCTAGA CGCTGTTTCG CAACCCCGAA G7CTACACCA 



2251 G7CCCTCCCC AAGTGG*CAG CTATGAAGCA GSAGAATCGG CTGAAAAT5G 
CACGGAGGGG TTCACCAGTC CATACT2CCT CCTCTTACCC GACTTTTAAC 



2301 CVACAATCTT GAAATAAAGA AACCGGACGG GAGCGAAXGC TTACCCCCAC 
GATG7CAGAA CTTTATTTCT TTGGCCTCCC CTCGCTTACG AATGCGGGTG 



2351 CGCCGGATGG TGTCAflAGGC WTCCAAGGT CCCGCTA7GT TCACAAAGCC 
GCCGCCTACC ACAGTCTCCG AAAGGTTCCA CGGCGATACA AGTGTTTCGO 



2401 CAAGGAACCG GGCCCTGCCC CG3TGACTA7 GCCTTTCACA AGGATGGAGC 
G77CCTTGGC CCGGGACGGG CCCACMATA CGGAAAGTGT TCCTACCTCG 



2451 TT2CTTCCTC 7ATGACAGGC 7GGC7TCAAC TGTAA7TTAC AGAGGAGTCA 
AAAGAAGGAG ATACTGTCCG ACCCAAGT7G ACATTAAAK TCTCCTCAG7 
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2501 ATCTCCCTGA CCGOGTJUWC GCATTCTTGA TATTGGCTAA ACCAAAGGAA 
TAXAACGACT COCCCATTAG CCTAAGAACT ATAACCCATT TGGTTTCCTT 

2551 ACGTTCCT7C AATCACCCCC CATTCGAOAG GCAGCAAAC7 AGAC7GAAAA 
TCCAAGGAAG TTACTGOOGG CTAAGCTCTC CGTCGTTTGA TCTCACTTTT 

2S01 9ACATGAAGT XACTATGCCA CATCCTACTT GGAGTACGAA ATCGAAAATT 
ATGTAGTTCA ATGATACCGT GTAGCATGAA CCTCATGCTT TAGCTTTTAA 



2*51 TTGGTGCTCA ACACTCCACG ACCCTTTTCA AAA7TAACAA CAATACTTTT 
AACCACCAGT TCTGAGGTCC TGGGAAAAOT TTTAATTGTT AT7ATGAAAA 



2701 CTTCTTCTCG ACACCCCCCA CACGCCTCAC TtXCTT T TCC AGCTCAATGA 
CAAGAAGACC TGTCCGGGCT GTGCCGAGTC AAGGAAAAGG TCGACTTACT 

2751 TACCATTCAA CTOCACCAAC AGTTGAGCAA CACAAC7GGG AAACTAATTT 
ATCGTAASTT GAAGTCOTTG TCAACTCGTC? GTGTTGACCC TTTGATTAAA 



2801 GGACACTAGA MCTAATATC AATGCTGATA TTGGTGAATG GGCTTTTTGG 
CCTGTGATC? ACGATTATAG TTACGACTAT AACCACTTAC CCGAAAAACC 



2851 CAAAATAAAA AAATCTCTCC GAACAACTAC CTGGAGAAGA GCTGTCTTTC 
CTTTTATTTT TTTACAGAGG CTTGTTGATC CACCTCTTCT CGACAGAAAG 

C?i01 GAAACTT7AT CGCTCAACGA GACAGAAGAC CATGATCCCA CATCG~CGAG 
CTTTCAAATA GCGAGTTGC? C7GCCTTCTG CTACTACGCT GTAGCAGCTC 

ifl^Sl AACTA CAAA G GCAAGAATCT CCGACCGGGC CACCAGCAAG TATTCGGACC 
TTGATGTT7C CCTTCTTACA GGCTGCCCCG GrGGTCCTTC ATAAGCCTGG 

%3001 TGGTTCCAAA GGATTCCCCT GGGAXGGTTT CATTGCACGT ACCAGAAGGG 
m ACCAAGGTTT CCTAAGGGGA CCCTACCAAA GTAACGTGCA TCCTCTTCCC 

3^51 CAAACAACAT TCCCGTCTCA GAATTCGACA GAAGGTCGAA GAGTACATGT 
^ C77TCTTGTA ACGGCAGAGT CTTAAGCTGT C7TCCAGCTT CTCATCTACA 

1JP101 CAATACTCAG GAAACTATCA CAGAGACAAC TGCAACAATC A7ACGCACTA 
CTTXMAGTC CTT7GATAGT GTCXCTGTTC ACGTTGTTAG TATCCGTGAT 

±3151 ACGGTAACAA CATGCAGATC TCCACCATCG GGACAGGACT GAGCTCCAGC 
Q TGCCA7TGTT GTACGTC7AG AGGTGCTAGC CCTG7CCTGA CTCGAGGTCG 



KCQl 



3201 CAAATCCTGA GTTCCTCACC CACCATCGCA CCAAGCCCTG AGACTCAGAC 
GTTTAGGACT CAAGG&CTGG CTCGTACCGT GGT7CGGCAC TCTGAGTCTG 

3251 CTCCACAACC TACACACCAA AACTACCAGT GATOACCACC CAGGAACCAA 
GACCTGTTGG ATG7GTGGTT C7GATGGTCA CTACTG3TGG CTCCtTGGTT 

3301 CAACACCaCC GAGAAACTCT CCTGGCTCAA CAACAGAAGC ACCCACTCTC 
GTWTGGTGG CTCTTTGAGA GGACCGAGTT CWCTCTTCG TGGGTGAGAfl 

3351 ACCACCCCAG ACAATATAAC AACACCGGTT AAAACTOTTT GGGCACAAGA 
TCCTGGGGTC TCTTATATTG T7G7CGCCAA TTTTGACAAA CCCGTGTTCT 
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3401 CTCCACAACC AACGGTCTAA TAACTTCAAC AGTAACACCT ATTCTTCCCA 
CAGGTGTTCO TTCCCACATT ATTGAAGTTG TCATTGTCCA TAAGJUCCC? 



3451 CCCTTGCACT TCGAAAACQC AGCAOXAOST AAGTTAACAC CAGCGCCACG 
CGGAACCTGA AGCTTTTGCG TCCTCTTCTO CTCAAXTGTG GCCCCGGTGC 



-3501 GGTAAATCCA ATCCCAACTT ACACTACTGG ACTGCACAAG AACAACATAA 
CCATTTACCT TAGGGTTGAA TCTGATGACC 7GAC676TTC TTGTTCT7ATT 



BataHX 



3551 TGCTGCTGS3 ATTGCCTGGA TCCCCTACTT TCCACCGGGT CCACAAGGCA 
ACGACGACCC TAACCGACCT AGGGCA2GAA ACCTGGCCCA CGTCTTCCG? 



3*01 TA2ACACTCA AGGCCTTATG CACAACCAAA ATCCCTTAOT CTCTCGACTC 
ATATCKACT TCCGCAATAC GTCTTGGTTT TACGGAATCA CACACCTGAG 



3651 AGACAACTTG CAAATCAAAC AACTCAAGCT CTCCAGCTCT TC~AAGGGC 
TCTGTTGAAC GTTTACTT7G TTGAGTTCGA CACGICCAAA AGAATTCCCG 



3701 CACGACCCAG CTGCCGACAT ATACCATACT CAATAGGAAG CCCATAGAT? 
GTGCMCCTC GACGCCTGTA TACGGTATCA GTTATCCTTC CGGTA7CTAA 



BaraHI 



SfSl TCCTTCTCCC ACGA7GGGCC GGGACATCTA G3ATCCTGGG ACCAGATTGT 
ffl AGGAAGACCC TGCTACCCCG CCCTCTAGAT CCTAGGACCC TCGTCTAACA 



5101 7CCATTGACC CACATCATTC GACCAAAAAC ATCACTGATA AAATCAACCA 
H ACGTAACTCG GTGTACTAAC CTGGTITTTG TAGTGACTAT TTTAGTTCGT? 



il51 AATCXTCCAT GA77TCATCG ACAACCCTTT ACCCAATCAC GATAATCATO 
m TTA3TAGGTA CTAAAGTAGC TGTTGGGAAA TGGGTTACTC C7ATTACTAC 



BanEI 



;3$01 A7AATTGCTG GACGGGCTGG AGACASTGGA TCCCTGCAGG AATAGGCATC 
TATTAACCAC CTGCCCGACC TCTGTCACCT AGGGACGTCC TTATCCCTAA 



§951 AC7GGAAT7A TTATTCCAAT CATTGCTC7T CTTTGCG7C? GCAAGCTCCT 
O TGACCT7AAT AA7AACGTTA GTAACGAGAA GAAACGCAGA CGTTCGACGA 



BamffI 



4001 TTG7TGAATA TCAGAATTCC AGCACTGGCG GCCGTTACTA GTGCATCCGA 
AACAACTTAT AGTCTT A ACG TCGTGACCGC CGGCAATGAT CACCTAOQCT 



Marl 



3«nRZ Xbal Kami fiaaHI 



4051 CCTCGGATCC AAGCTCTAGA CCAGGCCCCT GGATCCAGAT CTGCTGTGCC 
CGAGCCTAGG TTCCAGATCT CCTCCGCGGA CCTAGGTCTA GACGACACGG 



4101 TXCTACTTGC CAGCCATCTG TTCTTTGCCC CTCCCCCCTC CCTTCCTTCA 
AAGATCAACG GTCGGTACAC AACAAACGGG GAGGGGGCAC GCAAGGAACT 



4151 CCCTGCAAGG TCCCAMCCC ACTCTCCTTT CCTAA7AAAA TGXGGAAATT 
GGGACCTTCC ACGGTGACGG TGACAGGAAA GGATTATTTT ACTCCTTTAA 
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4201 CCATCGCATT CTCTGXGTAG OTGTCAMCT AWCTGGGGG CMGGGTGGG 
CCTAGCGTAA CAGACTCATC CACAGTAAGA TAAGACCCCC CACCCCACCC 



SphI 



4251 CCAGCACAGC AAGGCGGAGG ATTGGGAAGA CAATACCAGG CA-GCTGGGG 
CGTCGTGTCG TTCCCCCTCC TAACCCTTCT GTTATCGTCC CTACGACCCC 



Xpnz 



4301 ATGCGOTGGC CTCTATGGCT ACCCAGGTGC TCAAGAATTG ACCCGCTTCC 
TACGCCACCC GAGATACCCA TCGGTCCACG ACTTCTTAAC TGCGCCAAGG 



4351 TCCTGGGCCX GAAAGAAGCA GGCACATCCC CTTCTCWTG ACACACCCTG 
XCGACCCGGT CTTTCTTCGT CCGTGTAGCG GAAGXGXCAC TGTGTGGGAC 



4401 TCCXCGCCCC TGGTTCTTAG TTCCAGCCCC ACTCATACCA CACTCATAGC 
AGGTGCGGGG ACCAAGAATC AAGGTCGGCG TGAGTATCCT GTGACTATCG 



4451 TCAGGAGGGC TCCGCCTTCA ATCCCACCCG CTAAAGTACT TGGAGCCGTC 
AG7CCCCCCG AGGCGGAAGT TAGGGTGGGC GATTTCATGA ACCTCGCCAG 



4501 TCTCCCTCCC TCATCAGCCC ACCAAACCAA ACCTAGCCTC CAXGAGTGGG 
AGMGGAGGG AGTAGTCCGG TGGTrTGGTT CGGATCGGAG GTTCTCACCC 



4§51 AAGAAATTAA AGCAAGATAG GCTATTAAGT GCAGAGGGAG AGAAAATGCC 
fn TTCTTTAATT TCGTTCTATC CGATAATTCA CGTCTCCCTC TCTTTTACGG 



5J Xnn2 

4^01 TCCAACATCT GAGGAAGTAA TGAGAGAAA? CATAGAATTT CTTCCCCTTC 
m AGGTTCTACA CTCCTTCATT ACTCTCTTTA GTATCTTAAA GAAGGCGAAG 



4^51 CTCGCTCXCT GACTCGCTGC CCTCGGTCCT TCGGCTGCGG CGAGCGGTAT 
GAGCGAGTGA CTGAGCGACG CGAGCCACCA AGCCGACGCC CCTCGCCATA 

s • • 

CAGCXACTC AAXGGCGGTA ATACGGTTAT CCACAGAA7C AGGGGATAAC 
GTCGAGTGAG TOTCCGCCAT TA7GCCAATA GGTCTC7TAG 2CCCCTATTG 



1f751 GCAGGAAAGA ACA7GTGAGC AAAAGGCCAG CAAXAGGCCA GGAACCGTAA 
O CGTCCTT7CT TGTACACTCG TTTTCCGGTC GTCTTCCGGT CCTTGGCATO 



4801 AAAGCCCGCG TTCCTGGCCT TTTTCCATAG GCTCCGCCCC CCKACGAGC 
TTTCCGGCCC AACGACCGCA AAAAGGTATC CGAGGCGGCG GGACCCCTCG 



4851 ATCACAAAAA TCGACGCTCA ACTCAGACCT GGCGAAACCC GACACGACTA 
TAGT GlViW AGCTGCGAGT TCACTCTCCA CCGCTTTGGG CTGTCCTGAX 



4901 TAAAGATACC AGCC G TTT C C CCCTGGAXGC TCCCTCGTGC GCTCTCCTGT 
ATTOCTATGC TCCGCAAAGG CGGACCTTCG AGOOAGCACG CGACACGACA 



4951 TCCGACCCTG CCGCTTACCG GATACCTCTC CGCCTTTCTC CCTTCGGGAA 
AGGCTGGGAC GGCGAATGGC CTATGGXCAG GCCGAAAGAC GGAAGCCCTT 



5001 GCGTGGCGCT TTCTCAATCC TCACCCTGTA GGTATCTCAG TTCGGTGTAG 
CGCXCCCCGA AAGAGTTACG ACTGCGACAT CCATAGACTC AAGCCACATC 
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5051 CTCGTTCCCT CCAAOCTGGG CTCTCTGCAC OACCCCCCC TTCAGCCCGA 
CAGCAAGCGA GCTTCGACCC CACACACGTG CTTGGGCGGC AAGTCCCCCT 



5101 CCGCTGCCCC TTATCCGGTA ACTATC07CT TCAGTCCAAC CCGGTAAGAC 
GGCCXCGCGG AATAGGCCAT TGATAGCAGA ACTCAGCTTG GGCCATTCTG 



5151 ACGACTTATC CCCACTGGCA GCAGCCACTC GTAACAGSAT 7AGCACACCG 
CGCTGAATAG CGGTGACCOT CGTCGG7GAC CATTGTCCTA ATCCTCTCGC 



5201 AGCTAXGXAG GCGGTGCTAC AGAGTTCTTG AAGTCGTGGC CTAACTACGG 
TCCAIACATC CGCCACGATG TCTCAAGAAC TTCACCACCC GATTGATGCC 



5251 CTACACCAQA AGGACAGTAT TTGGXATCTG CGCTCTSCTG AACCCACTTA 
GATCTGATCT TCCTGTCATA AACCATAOAC GCGACACGAC TTCGGTCAAT 



5301 CCTTCCGAAA AAfiAGTTGC? AGCTCTTGXT CCGGCAAACA AACCACCCCT 

ccxAGccrrr wctcaacca tcgagaacta ggccctttg? ttcgtggcga 



5351 GGTAGCGGTC GTTTTTTTGT TTGCXACCAG CAGATTACGC GCAGAAAAAA 
CCATCGCCAC CAAAAAAACA AACGTTCGTC GTCTAATGCC CGrTCTTTTTT 



5401 ACGATCTCXA GAAGATCCTT TGAKTTTTC TACCGGGTC7 CACGCTCAG? 
TCCTAGACTT CTTCTAGGAA ACTAGAAAAC ATGCCCCAGA CTGCGAGTCA 

851 GGAACGAAAA CTCACGTTAA GGGATTTTGG TCAMAGATT ATCAAAAAGG 
CCTTGCTTTT GAGTGCAATT CCCTAAAACC AGTACTCTAA TAGTTTTTCC 

fjSQl ATCTTCACCT AGATCCTTTT AAA77AAAAA TCAAGTTTTA AATCAATCTA 
]zf TAGAACTGGA TCTAGGAAAA TTTAAT7TTT ACTTCAAAAT TTAGTTAGAT 



SS51 AAGTATATAT GACTAAACT7 CCTCTGACAG TTACCAATGC TTAATCAGTG 
m TTCATATA2A CTCATTTGAA CCAGACTCTC AATGGTTACG AATTAGTCAC 



S€Q\ AGGCACCTAT CTCAGCGATC TGTCTATTTC GTTCATCCAT AGTTGCCTGA 
1^ TCCCTCGATA CAGTCGCTAG ACAGATAAAG CAAGTAGGTA TCAACGGACT 



||51 CTCCGGGGGG CCCGGGCCCT GAGGTCTGCC TCGTCAAGAA GCTGTTGCTG 
1^ CAGGCCCCCC CCCCCCGCGA CTCCAGACGG ACCACTTCT? CCACAACCAC 



5701 AMCATACCA GGCCTCAATC GCCCCATCAT CCAGCCAGAA AGTGAGGGAG 
jUJ TGAGTATGGT CCCGACTTAG CGGGGTAGTA GG7CGGTCTT TCACTCCCTC 



5751 CCACGGTTGA TGACAGCTTT GTTGTAGGTG GACCAGTTGG 7GAWTTGAA 
GGTCCCAACT ACTCTCGAAA CAACATCCAC CTGGTCAACC AC7AAAACTT 



5«01 C7XCTGCTTT CCCACGGAXC GGTCTCCQTT GTCCCGAAGA TCCGTCATCX 
GAAAACCXAA CGGTGCCTTG CCAGACGCAA CAGCCCTTCT ACGCACTAGA 



5S51 GATCCTTCAA CTCAGCAAAA GTTCGATTTA CTCAACAAAQ CCGCCCTCCC 
CTAGGAA5TT GAGTCCTTTT CAXCC7AAAT AAGTTGTTTC GGCGGCAGGG 



5901 GTCAAGTCAC CGTAATGCTC TGCCACTGTT ACAACCAATT AXCCAATTCT 
CAGTTCAG2C GCATTACGAG ACCGTCACAA TGTTGGTTAA TTGGTTAAGA 



5951 GATTACAAAA ACTCATCGAG CATCAAATCA AACTGCAATT TATTCATATC 
CTAA3CTTTT TCAGTAGCTC GTAGTTTACT TTGXCGTTAA ATAAGTATAG 
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«001 AGCATTOICA ATACCATATT TTTGAAAAAC CCQTTPCTCT AA7GAAG0AG 
TCCTAATAOT TATOGTATAA MACTTTTTC GCCAAAOACA 7TACXTCCTC 

€051 AAAAC3CACC GAOOCAGTTC CAXACCATOG CAAGATCCTO GTA7CGCTCT 
TTTWASTCC CTCCCTCAAG CTA7CCTACC CMCTACGAC CATAGCCAfi A 

€101 CCGATTCCGA CTCGTCOOC ATGAA7ACAA CC7A7TAATT TCCCCTCCTC 
CGC7AAGGCT CAGCACC7TG 2ASTTA7GT7 CGA7AA77AA AGGGGAOCAG 

6151 AAAMTAAGG TTATCAAfiTC AOMATCACC ATCXCTGACC ACTGAA7CCG 
T7TTTATTCC AATAGTTOC 7CTTTAGTGG TACTCACTCC TGACTTAGGC 

HiftdlH 

€201 CTGAGAATGO CXAAAGCTTA TGCATTTCTT TCCAGACTTO 7CCAACAGCC 
CACTCTTACC OTTTT C CAAX ACCTAAAGAA ACGTCTGAAC AAG7TGTCCG 

€251 CACCCXTXAC CC7CGTCA2C AAAATCAC7C GCATCAACCA AACCGT7A7T 
GTCCCTAATG CGAGCAG7AG T77CAGTGAG CCTAGTTGC7 TTGGCAATAA 

Pvul 

6301 CATTCCTOAT 7CCCCCTGAC CGAGACCJULX CACGCGATCG CTCTT AAAAG 
CTAAGCACCA ACGCGGXCTC CCTCTGCTTT ATGCGCTACC GACAA77CTC 

^351 GACAA7TACA AACAGGAATC GAA7GCAACC GGCGCAGGM, CACTGCCAGC 
Cl C7CTIAA7CT T7GTCCTCAG C7TACGTTGG CCGCGTCCTT GTGACGGTCG 

?€401 GCATCAACAA TATTXTCACC TGAATCAGGA TATTCT7C7A ATACCTGGAA 
CGTAGTTCTT ATAAAAGTGG AC7TAGTCC7 A7AAGAAGAT 7ATGGACCT? 

it451 TGCTCTTTTC CCCGGGATCG CAG7GGTCAG TAACCATGCA TCATCAGGAG 
01 ACGAC&AAAG CGCCCCTAGC GTCACCACTC ATTGCTACGT ACTAGTCCTC 

CS501 TACGGATAAA ATCCTTGATC C7CGGAAGAG GCATAAATX CGTCAGCCAG 
A7GCCTATTT 7ACGAAC7AC CAGCCTTCTC CGTATTTAAG GCAGTCGGTC 

r ~<531 77TASTCTGA CCATCTCA7C TGTAACATCA TTGGCAACGC TACCTTTGCC 
AAA7CAGAC7 CG7AGAG7AC ACATTCTAGT AACCCTTCCG ATGCAAACGG 

6€01 ATCTTTCAGA AACAACTCTG GCGCATCGGG C7TCCCATAC AA7CGA7ACA 
TACAAAC7CT TTGTTGAGAC CGCGTAGCCC GAAGGGTATG TTAGCTXTC7 

£651 C7GICCCACC 7GATTGCCCC ACATXATCCC GAGCCCATTT ATACCCATAT 
AACAGCCTGG ACTAACGGGC TGTAATAGCC CTCGGCTAAA TATGGCTATA 

6701 A&A7GJIGCA? CCATGTTGGA AT7TAATCGC GGCC7CGAGC AACACGTTTC 
WTAGICG7A GGTACAACCT TAAATTAOCG CCGOAGCTCC 7TC7GCAAAG 

€751* CCCTTG3^7A TGGC7CATAA OUXECTTGT A77ACTGTTT ATGTAAGCAG 
CKMCTTA5 ACCGA07AT7 C7GGGGAACA 7AATGACAAA TACATTCOTC 

€001 ACAGTTTTAT TGTTCATGA7 CATATATTTT TATCCTG7GC AATG7AAGAT 
TGTCAAAA7A ACAAG7ACTA CTA7A7AAAA ATAGAACACG TTACA7TGTA 
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f S51 CACXPTTTT GAGACACAAC CTGGCTTTCC CCCCCCCCCC A7TX3TGAAG 
CTCTOTUAA CTCTOTCTTC CACCCXAAGG CCCGGGGCGC TAATAACTTC 



€901 CAT7Zft?GAC GGTTATTOTC TCXMAGCGC ATACATXTTT GAATCTATTT 
CTAAAIACTC CCAATAACAC AOTAMCGCC TAMTATAAA CT7ACATXAA 



$951 AGAAAAATAA ACAAATAGGG CTTCCGCCCA CATCTCCCCG AAAAGTGCCX 
TCTXTTTACT TGTTTAXCCC CAAGGCCCGT GTAAAGGGGC TTTXCACCCT 



7001 CCTCACOTCT AAGAAACCAT TATTATCATG ACATTAACCT ATAAAAATAG 
CGACT O CXG A TI C TTTGOTA ATAATAGTAC TGTAATTGGA TATOTTTATC 



70S1 GCGTAXCACG AGCCCC7TTC GTC 
CGCATAGTGC TCCGGGAAAG CAG 
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SEQUENCE LISTING ID NO: 3 

General Description 

SKJl PVR 1012 • CP (r) 
local object; 

Creafcftdt 09/15/9S 05:06PM 
Last Modification Date: ? (no data} 
lerctfcs 7285 bp 
st cray* type; Baaic 
fonr.i Circular 
,ts 



Restriction Map 

Dralfclsfte g*g**g£ 
Hindiflrisfte gggg; 
Hpaklsite g^g 
KasI* 1 she cccccc 

Narfclshe cc^mo 

Not!* 1 £h& CCGGCCCC 

Pmlfclsite 
pvubiste g™g[ 
Saclfclsae cggg 
XbaUlsite «A 
XholMsite §£«» 
EcoRV:2s«tes 
NcoL2 sites gggg 
Ndel:2s»es gggjg 
Sphl:2 sites gggg 

Functional Map 

COS {4 signals) 

CMV1E 5* UT 

Start i BBC Ends 1129 
CMVIEINT 

Starts 1130 Endt 1840 
TbGH 

Start: 4302 E=dt 4854 
Kanr 

Stert: €3S0 Sndt 6972 (Conpleoentary) 
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MtSG_fe*ture (2 signal*) 
CMV enhancer 

Starti 24B tad: 885 
Startt 1870 find: 4301 

Annotations 
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1 TCGCCCGTTT CCGTOATGAC GGTGAAAAC C TCTCACACAT GCACCTCCCG 
ACCG C GCAAA GCCACTACTG CCACTTTTGG AGACTGTGTA COTCGAGCCC 

51 GAOC OGTCA CAGCITOTCT OTAAOCGGAX GCCCGGACCA GACAAGCCCO 
CTCTGCCAGT GXCGAACAGA CACTCGCCTA CGGCCCTCOT CTCTTCCGOC 

101 T C AGG C C GCC 7CAGCCCGTG TTCGCGGGTG TCGGGGCTCG CTTAACTATG 
AGTCCCGCGC AGTCGCCCAC AACCGCCCXC AGCCCCCACC GAATTGATAC 

151 OgCATCACA GCAGA»G<2A CTCXGAGTGC ACCATATGCG GTCTGAAATA 
COCCiaCTCT CGTCTAAGAT GACTCTCACG TGG7ATACGC CACACTT7AT 

201 C CGCACACA T GCCTAAGOAG AAAATACCGC ATCAGAWGO C7ATTGGCCA 
GGCGTCTCTA CCCATTCCTC TXTTATCGCG TAGTCTAACC CAffAACCGG? 

251 TTCCATACCT TGTATCCATA SCATAATATC TACATTTATA TTCCCTCATC 
AACGXATGCA ACATAGGTAT AGTATTATAC ATGTAAATAT AACCGACTAC 

301 TCCAACATTA CCGCCATGTX GACATTGATT ATTGACTAGT TAMAATAGT 
AGGTTCTAAT GGCGGTACAA CTGTAACTAA 7AACTGATCA ATAA77ATCA 

.351 AATCAATOAC GGGGTCATTA G7TCATAGCC CATA7ATGGA CTTCCGCCT7 
t STACTTAATG CCCCAGTAAT CAACTA7CGG CTATATACCT CAAGGCGCAA 

1401 ACATAACTTA CGGTAAATGG CCCGCCTGGC 7GACCGCCCA ACGACCCOCG 
7G7ATTGAAT CCCATTTACC GGCCCGACCC ACTGGCGGGT 7CCTGGGGGC 

!<51 CCCATTGACG TCAATAATCA CGTATCTTCC CATAGTAACG CCAATAGGGA 
'j GGGTAACTGC AGTTATTACT GCATACAAGG GTATCAKGC GCTTATCCCT 

1501 C7T7CCATTC ACGTCAATGG GTCGAG?ATT TACGGTAAAC 7GCCCACTTC 
GAAAGGTAAC TGCAGTTACC CACC7CATAA ATGCCATTTG ACGGGTGAAC 

NdeX 



s 551 GCAGTACATC AAGTGTATCA TATGCCAAGT ACGCCCCCTA TTGACGTCAA 
CGICA3OTAG TTCACATAGT ATACGGTTCA 7GCGGGGGAT AAC7GCAGTT 



= €01 TGACCGTAAA TCGCCCCCCT GGCATTATCC CCAGTACATG ACCTTATGGC 
ACTGCCATTC ACCCGGCGGA CCGTAATACG GGTCATG7AC TCGAATACCC 



Ncol 

€51 AC2TTCCTAC TTGGCAGTAC ATCTACGTAT TAGTCATCGC TATTACCATG 
TGAAACGATG AACCGTCATG TAGATCCATA AXCAGTAGCC ATAATGCTAC 



KeoX 

701 GTGATGCGGT TTTGGCAGTA CATCAATGGG CG7CGATAGC GGTTTGACTC 
CACTACGCCA AAACCOTCAT GTACTTACCC CCACCTAICQ CCAAACTGAG 



751 ACCCCGATTT CCAAGTCTCC ACCCCATTGA CGTCAATCGG A G Z 'X T GTTIT 
TCCCCC7AAA GGT3CAGAGG TCGGGTAACT GCAGTTACCC TCAAACAAAA 



801 CGOUCCAAAA TCAXCGGGAC 7T7CCAAAAT GTCGTAACAA CTCCGCCCCA 
CCGTCCTTTT AGOTGCCCTC AAAGCWTTA CAGCAT7GTT GAGGCGGGCT 
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•SI TTGACGCAAA TOGGCGOTAO GCC7QTAC0G TCGGAGGTCT ATATAAGCAC 
AACMCOTTT ACCCGCCATC CGCACATCCC ACCCTCCAGA 7ATATTCGTC 



$01 AGCTCCTTTA CMAACCGTC AGATCGCCTG OAOACCCCAT CCACCCTGW 
TCGACCAAAT CACTTGCCAG TCTAGCCGAC CTCTCCOGTA GCTGCGACAA 



SaeXX 



S51 CTGACCTCCA TAGAASACAC CGGGACCGAT OCAGCCTCCO CGGCCCGGAA 
A&CTGGASGT ATCTTCTGTO CCCCTGGCTA GGTCGCAGGC GCCGGCCCTT 



1001 CGGTGCATTG GAACCCGGA7 TCCCCGTCCC AAGAGTCACC TAACTACCGC 
GCCACCTAAC CTTGCGCCTA AGGGGCACGG TTCTCACTGC ATTCATGGCG 



CphI 



2051 CXATAGXCTC TATAGGCACA CCCCTTTCGC TCTTATGCAT GCTATACTGT 
CA2ATCTCAG ATATCCCCGT GGGGAAACCC AGAATACGTA COATATGACA 



1101 TTtTCGCTTG CGGCCTATAC ACCCCCGCTT CCTTATGCTA TAGGIGATGC 
AXAACCGAAC CCCGGATATG TCGCCGC6AA CGAATACGAT ATCCACTACC 



1151 7ATAGCTTAG CCTATASSTQ TGGGTTATTG ACCATTATTC ACCAC7CCCC 
ATATCGAATC GGATATCCAC ACCCAATAAC TGGTAATAAC TGGTGACGGG 



WtlQl TATTGGTGAC GATACTTTCC AOTACTAATC CATAACAKG CTCTTTGCCA 
ffl ATAACCACTG CTATGAAAGG TAATGATTAG GTATIGTACC CAGAAACGGT 



S%1S1 CAACTATCTC TATTGGCTAT ATGCCAATAC TCTCTCCTTC ACAGACTGAC 
H GTTGATAGAG ATAACCGATA TACGGTTATG AGACAGGAAG TCTC7GACTG 



0lt301 ACCGACTCTC TATTTTTACA GCATGGGGTC CCATCTATOA TTXACAAATT 
f?! KCCTCAGAC A7AAAAATGT CCTACCCCAG GGTXAATAAT AAATGTTTAA 



LllSl CACATATACA ACAACGCCCT CCCCCCTGCC CGCAGT7TTT ATTAAACATA 
%J GTCf ATATGT TGTTGCCGCA GGGCGCACGG CCG7CAAAAA TAATTTGTAT 



1401 GCGTGGGATC TCCACGCGAA TCTCGGCTAC GTGTTCCGGA CATGGGCTCT 
> CGCACCCTAG AGGMCCCTT AGAGCCCATC CACAAGGCCT GTACCCCAGA 



^1451 TCTCCGCTAG CGGCGGAGCT TCCACATCCG AGCCCTGGTC CCATGCCTCC 
M= AGAGCCCA1C CCCGCCTCGA AfiCTGTAGGC TCGGGACCAG GGTACGGAGG 



1501 AGCGCCTCAT CCTCGCTCGG CAGCTCCTTG CTCCTAACAG TGGAGGCCAG 
TCCCCGACTA CCAGCCAGCC GTCGAGGAAC GAGGA7TC7C ACCTCCGGTC 



1551 ACWAGGCAC AGCACAATCC CCACCACCAC CA G TQT GC CC CACAAGCCCG 
TGAATCCOTG TCGTCTTACG GGTGGTGGTG GTCACACCGC GTGTTCCGGC 



1*01 TGGCGCTAGG GTATGTGTCT GAAAATCXGC GTGGACATTG GGCTCGCACG 
ACCCCCATCC CATACACAGA CTTTTACTCG CACCTCTAAC CCGAGCCTGC 



1C51 GCTGACGCAC ATGGAAGACT TAACGGAGCG CCAGAAQAAG ATGCAGCCAG 
CGACTGCCTC TACC7TCTGA ATTCCGTCCC CGTCTTCTTC TACCTCCGTC 



1701 CTGAGTTGCT GTATTCTGAT AAGAGTCAGA GGTAACTCCC GTTGCGGTGC 
GXCICAXCAA CATAAGACTA TTCTCACCCT CCATTGAGOG CAACGCCACG 
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JtptX 



1751 TGTTAACGGT CG A CCGC A GT CTAGTCTOAG CAGTACTCGT TCCTCCCGCG 
ACAATTGCCA CCTCCCOTCA CATCAGACTC CTCATGAGCA ACGACGGCGC 



Ncor 

1B01 CGCGCCACCA GACA7AATAG CTCACAGACT AACAGACTGT TCCTTTCCAT 
CCCCGGTGGT CMTATTXTC GACTGXCTCA tTTGTCTCACA AGSAAACCTA 



tail EcoRVNotl 



1851 CGGTCTTTTC TGCACTCACC GTCCTCCACA CCTGTGATCA GA7ATCGCGG 
CCCACAAAAG ACCTCAOTGC CAGCAGCTCT CCACACTAGT CTATAGCCCC 



Karl 



1901 CCGCTCTAGA CCAGGCGCCT GGATCGATCC GCGATCAACA TTAACCCGAC 
GGCCAGATCT CG7CCGCCCA CCTACCTAGG CGCTACTTCT AATTCGGCTG 



1951 AGTCAGCGXA ATCTrCAXCT CTCT7AGATT ATTTC7TTTC CAGAGTAGCC 
TCACTCGCAT TAGAAGTAGA GAGAATCTAA TAAACAAAAG CTCTCATCCC 

CJOOl CTCCTCAGGT CCTTTTCAAT CGTGTAACCA AAATAAACTC CACTAGAAGG 
CAGCAGTCCA GCAAAAGTTA GCACATTGGT TT7ATTTGAG GTGATCTJCC 

S051 ATATXGTGGG CCAACAACAC AATGGGCGTT ACACGAATAT TCCAGTTACC 
TATAACACCC CG7TGTTGTG ~ACCCGCAA TGTCCTTATA ACGTCAATCG 

"£-01 TCCTGATCGA TTCAACAGGA CATCATTCTT TCTTTGGGTA ATTATCCTTT 
fjl AGCAC7AGCT AACTTCTCCT GTAGTAACAA AGAAACCCAT TAATAGGAAA 

1^51 TCCAAAGAAC ATTCTCCATC CCACTTCGAG 7CATCCACAA TACCACACTA 
f AGCTTTCTTG TAAAAGGTAG GGTCAACCTC AGTAGGTCTT ATCGTGTAAt? 



12201 CACCTTAGTC ATGTCCACAA ACTAOTTTGT CCMACAAAC TGTCATCCAC 

1^ CTCCAATCAC TACAGCTGTT TGATCAAACA GCACTGTTTG ACACTAGGTG 

'5251 AAATCAA7TG AGATCAGTTG GACTGAATCT CGAACGGAAT GGAGTGGCAA 

U TTTACTTAAC TCTAGXCAAC CTGACTTAGA CCMCCCTTA CCTCACCCTT 



2301 CTGACGTGCC ATCTGCAACT AAAACATGGG CCtTCAGGTC CGGTGTCCCA 
CACTGCACCG TAGACGTTGA TT7TCTACCC CGAACTCCAG GCCACAGGGT 



2351 CCAAAGGTGG 7CAATTATGA ACCTGGTGAA TGGGCTGAAA AC7GCTACAA 
GGTTTCCACC AGTTAATACT TCGACCACTT ACCCGACTTT TGACGATGTT 



2401 TCTTGAAATC AAAAAACCTG ACGGGAGTGA GTCTCTACCA GCAGCGCCAG 
AGAACTTTAG TTTTTTGGAC TCOCCTCACT CACAGA2GGT CGTCGCGOTC 



2451 ACGG6ATTCG GGGCTTCCCC CGOTGCCCGT A2PCTCCACAA ACTATCAGGA 
TGCCCTAACC CCCGAAGGGG GCCACGGCCA 7AGACGTGTT TCATAGICCT 



2501 ACGGGACCCT GTGCCCGAGA CTTTGCCTTC CATAAAGAGG OTGCTTTCTT 
TGCCCTGGCA CACGGCCTCT GAAACCGAAG GTATMCTCC CACGAAAGAA 
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2SS1 CCTCTATCAT COACTTCCTT CCACAGMAT CfACCGAGGA ACGACTTTCC 
GGACAT ACTA GCTGAACOAA GGTGTCAAXA GATGGCTCCT TGCTGAAAfl C 



2601 CTGAAGGTCT CGTTGCATTT CTGATACTGC CCCAAGCTAA GAAGGACTTC 
CACTTCCACA GCAACCTAAA GACTATGACC CGG2TCGATT CTTCCTCAAG 



2651 TTCAGCTCAC ACCC C TTGAG AGACCCGGTC AATGCAACGG AGGACCCCTC 
AAGTCCACTG TGCCGAACTC TC7CGGCCAQ TTACGTTGCC TCCTCCGCAG 



ECORV 



2701 TACT GGC T A C TATTCTACCA CAATTAGATA TCAGGCTACC GGT7TTGCAA 
ATCACCGATG ATAACATGGT CTTAATCTAT AGTCCGATCG CCAAAACCTT 



2751 CCAATGAGAC AGAGTACTTG TfCGAGGTTC ACAASTTGAC CTACCTCCAA 
CGTTACTCTG TCTCATGAAC AAGCTCCAAC TGTTAAACTG GATCCAGGTT 



2801 CT7GAASCAA GATTCACACC ACAGTTTCTC CTCCAGCTGA ATGAGACAAT 
GAACWAGTT CTAAG7G7GG TGTCAAAGAC GAGCTCGACT TACTC7GTTA 



2851 ATATACAAGT GGGAA&AGCA GCAA7ACCAC CGGAAAACTA AT77GGAAGG 
TATATGTTCA CCCTTTTCC? CGTTATGGTC CCCTO7CAT TAAACCTTCC 



M 01 SCAACCCCGA AATTGXTACA ACAATCGGGG AGTGGGCCTT CTGGGAAACT 
ACTTGGGGCT TTAACTATGT TCTTAGCCCC TCACCCGGAA GACCCTTTGA 



§951 AAAAAAAACC TCACTAGAAA AATTCGCAG7 GAAGAGT7CT CTTTCACAGT 
Hi TTTO7TTGG ACTGATCTTT TTAAGCGTCA CTTCTCAACA GAAAGTGTCA 



3t>01 TGTATCAAAC GGACCCAAAA ACAXCAGTCG 7CAGAGTCCG GCGCGAACT7 
ACATASTTTC CCTCGGTTT? TGTAGTCACC AC7CTCAGGC CGCGCWGAA 



SCSI CTTCCCACCC AGGGACCAAC ACAACAACTG AAGACCACAA AATCXTGGCT 
GAAGGCTGGG TCCCTGGTTG T C TTCTTGAC TTCTGGTCTT TTAGTACCGA 

="Il01 7CAGAAAATT CCTCTCCAA? GCTTCAAGTG CACAGTCAAG GAAGCGAAGC 
U I ACTCTTTTAA GGAGACGTTA CCAAGTTCAC GTGTCAG7TC CTXCCCTTCG 



3151 TGCAGTGTCG catctaacaa cccttgccac aatctccacg actccccaa? 
ACGTCACAGC GTAGATTGTT GGGAACGGTG TTA6AGGTGC TCAGGGGTTA 



4201 CCCICACAAC CAAACCAGCT CCGGACAACA GCACCCATAA TACACCCGTG 
GGGAGTGTTC GTTTCCTCCA CGCC7G7TCT CGTGGGTATT ATGTGGGCAC 



3251 TATAAACTCG ACATCTCTGA GGCAACTCAA GTTGAACAAC ATCACCGCAG 
ATATKGAAC TCTAGA6AC? CCGTCGAOTT CAACWGTTC TAGTGGCCTC 



3301 AACAGACAAC GACAGCACAG CCTCCGACAC TCCCTCTGCC ACGACCCCAO 
TTGTCTGTTG CTCTCGTGT C GGAGGCTCTG AGGGA&ACGG TGCTGGCCTC 



3351 CCGGACCCCC AAAAOCAGAG AACACCAACA CGAGC AAGAO CACTGACTTC 
GCCCTGGOGG TVPTOC T MC TTCTGGTTGT CCTCGTTCTC GTGACTGAAG 



3401 CTGGACCCCG CCACCACAAC AAGTCCCCAA AACCACAGCG AGACCGCTGG 
GACCTGGCGC GG7CGTG7TG TTCAGGGGTT TPGGTGTCCC TCTGGCGAOC 
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34S1 CAACAACAA C ACTCATCACC AACATACCGG AGAACAGAGT GCCACCAGCG 
CTTCrTCTTO TGACTAOTGG TTCTAMGCC TCTTCTCTCA CCGTCGTCGC 



3501 GGAAGCSAGO CTTAATTXCC AATACTATTG CTCOAaTCGC ACGACTCATC 
CCTTCGATCC CAATTAATCQ TTATGATAAC OACCTCACCG TCCTCACTA3 



3551 ACASGCGGGA CAASAACTCG AAGAGAAGCA ATTGTCAATG CTCAACCCAA 
TC gCCG CCCT CTTCTTGAflC TJCTCTTO» T TAACAGTTAC CAGTTGGG7T 



3$01 XTCCA ACC CX AATTTACATT ACTCGACTAC TCAGCATQAA 6GTGCTGCAA 
MCGTTOGCA TEAAATCTAA TGACCTGATG AGTCCTACTT CCACGACGTT 



3651 TCCGATOCCC CTGGATACCA TATTCCCGGC CAGCAGCCQA GGCAATTTAC 
A SCCTG ftCCG GACCTATGGT ATAAAGCCCG CTCOTCGGC? CCCTTAAATC 



3701 X SAGASC CCC TAATCCACAA TCAAGATGCT CTA A ICT G TG GGTTGAGACA 
TATCTCCCCG ATTACGTCTT AGCTCTACCA AASTAGACAC CCAACTCTGT 



3751 GCTGGCCXAC GACACGACTC AAGCTCTTCA ACTGTTCCTG AGACCCACAA 
CGACCGGTTG CTC7 GC T GA G 7TCGAGAAGT TGACAAGGAC SCTCGGTGTT 



3801 CSGAGCTACO CACCTTTTCA ATCCTCAACC CTAAGGCAAT »GATTTCTTG 
GACTCGATGC GTCGAAAAGT TAGGAGTTGG CATTCCCTTA ACTAAAGAAC 



3851 CTGCAGCGAT GGGGCGGCAC ATGOCACATT CTGGGACCGG ACTCCTGTAT 
GACCTCGCTA CCCCGCCCTG TACGCTGTAA GACCCTGGCC TGACGACATA 



39C1 COAACCACAT GATTCCACCA AGAACATAAC AGACAAAAT? GATCACATTA 
^ GCTTCCTGTA CTAACCTGGT TCTTGTACTG TCTCTWTAA CTACTCTAAT 



^}351 TTCAKAJTT TCTTGATAAA ACCCTTCCCG ACCAGGGGGA CAA7CACAA? 
01 AW3TACTAAA ACAAKATTT TGGCAAGGCC TGCTCCCCCT CTTACTGTTA 



4001 TGGTCGACAG CATGGAGACA ATGGA7ACCG GCAGCTATTG GAGTTACAGG 
ACCACCIGTC CTACCTCTGT TACCTA7GGC CGTCCATAAC CTCAATGTCC 



m05X CGWATJATT GCAGTTATCG CTTTATTCTG 7ATATGCAAA TTTGTCTTTT 
y, GCAATATTAA CG7CAATAGC GAAATAAGAC XXATACGTTT AAACAGAAAA 



3101 AGTTTTTCTT CAGATTGCTT CATGGAAAAG CTCAGCCTCA AATCAATCAA 
%J TCAAAAACAA GTCTAACGAA CTACCTTTTC GAGTCGGA6T TTAGTTACTT 



€131 ACCAGGATTT AATTATATCG ATTACTTGAA TCTAAGATTA CTTGACAAAT 
TGGTCCXAAA TXAATATACC TAATOAACTT AGATTCTAAT GAACTCTTTA 



4301 GATAATATAA TACACTGGAG CTCTAAACAT AGCCAATCTG A7TCTAACTC 
CTATTATATT ATQTGACCTC GAAA7TTCTA TCGGTTACAC TAAGATTGAG 



4351 CTTTAAACTC ACAGTTAATC ATAAACAAGG TTTOGTACCG AGCTCGAAW 
GAAATTTGAG TCTCAAXTAG TATTTGTXCC AAACCATCGC TCGAGCTTAA 



4301 ATCTCCTGTG CC TTCTASTT GCCAGCCATC TGTTGTTTGC CCCXCCCOCG 
TAGACGACAC GGAACATCAA CGCTCGGTAG ACAACAAACG GGGAGGGGCC 



4351 TOCCTXCCTT GACCCTGGAA GGTCCCACTC CCACTGTCCT TTCCTAATAA 
ACGGA&CGAA CTGGGACCTT CCACGGTGAC GGTGACACCA AAGGAWATT 
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4401 AATCXGOAAA TTQ CA TCOCX TT O T C TOAOT ACCTGTCATT CTATTC7GCC 
WACTCCTTT AACCTAGCGI AACAGACTCA TCCACAGTAA GATAACACCC 



44S1 CGGTGGGCTG GGGCXCCACA QCXXOOQOOA. CGATTGGCAA OACAATACCA 
CCCSCCCCXZ CCCGTCGTGT COTTCCCCCT CCTAACCCTT CTCTTATCGT 



Sptl 

450X CCCKXCCCQ GGWCCGOTO CCCTCTATCG GTACCCACCT GCTGAACAAT 
CCC23CCACC CCTACOCCXC CCGAGATACC CATGGCTCCA CCACTTCTTA 



4S51 TGftCCCCOTT CCTCCTQGCC CAGAAACAAG CASGCACA T C CCCTTCTCTC 
ACTGGGCCAA GGAGGACCCG GTC m CT T C GTCCGTGTAG GGCAAGAGAC 



4*01 TOACACACCC TCTCCACGCC CCTG G TT C TT AGTTCCAOCC CCACTCATAG 
ACTG2GTGGG ACAOGTCCGO GGACCAXGAA TOOGGTCG6 GCTGAGTXTC 



4451 GACXCTCATA GCTCAGGACG GCTCCGCCT? CAATCCCACC CGttAAAGIA 
CTGTGASXAT CGAGTCCTCC CGAGGCGGAA GTTAGGGTGC CCGA7TTCAT 



4701 CTTGGAGCCa TCTWCCCTC CCTCATCAGC CCACCAAACC AAACCTAGCC 
GAACCTCOCC AGAGAGGGAG GGAGTAGTCC CGTGC7TTCC5 TTTGGATCGG 



4751 7CCAACAGTG GGAAGAAATT AAAGCAAGAT AGGCTATTAA GTGCACAGGG 
O AGGTTCTCAC CCXTCTTTAA TTTCGTTCTA TCCCATAATT CACGTCTCCC 



pOl AGAGAAAATG CCMCAACAT CTGAGGAACT AATGAGAGAA ATCATAGAAT 
^ TCTCTTTCAC CGAGGTTCTA CAC7CCTTCA TTACTCTC7T TAGTATCTTA 



lISl rrCTTCCGCT TCCTCGCTCA CTGACTCCCr GCGCTCGGTC GTTCGGCTGC 
M AAGAAGGCGA AGGAGCCAGT GACTGAGCGA CGCGAGCCAC CAAGCCGACG 



OfjOl CCCCASCGGT ATCAGCTCAC TCAAAGGCGG TAATACGG77 ATCCACAGAA 
'~ CCGCTCCCCA TAGTCGAGTG AGTTTCCGCC ATTATGCCAA TAGGTGTCTX 



■llsi TCAGGGGATA ACGCAGSAAA CAACATCTCA GCAAAAGCCC AG CAAAAG GC 
|i] AGTCCCCTAT TCCGTCCTTP CTTGTACACT CGTCTTCCGG TCGTTTTCCG 



*001 CAGGAACCGT AAAAAGGCCC CGTTGCTCGC GTTTTTCCAT AGGCTCCGCC 
S GTCCT7GGCA WmC CCGC GCAACGACCG CAAAAAGC7A TCCGAGGCGG 



HSOSI CCCCTGACGA GCATCACAAA AATCGACGCT CAAGTCACAG GTGGCCAAAC 
GGGGACTGCT CCTAGTGTTT TTAGCTCCGA GTTCAGTCTC CACCGCTTTG 



5101 CCGACAGGAC TATAAAGATA CCAGGCCTTT CCCCCTOGAA GCTCCCTCCT 
CCCTGTCCTC ATATTTCTAT GGTCCGCAAA CGCGGACCTT CGAGGGACCA 



S151 CC GCTCTCCT GTTCCGACCC TOCCGCTTAC CGGATACCTG TCCGCCTTTC 
CGCGAGAGOA CAAGGCTGGG ACGGCCAATG GCCTATGGAC AGGCGGAAAG 



5201 TCCCTTCGGG AAGCGTGGCC CT77CTCAAT GCTCACGCTG TAGCTATCTC 
AGCGAAGCCC TTCGCACCOC GAAAGAGTTA CGAGTGCGAC ATCCATAGAG 



5251 AGTTCGC1CT AGGTCGTTCG CtCCAAGCTG GCCTOTGTGC ACGAACCCOC 
TCAAGCCACA TCCAGCAAGC GAGGTTCGAC CCGACAHACQ TGCTTGGGGG 
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5301 C GTTCA CCC C CA CC Q C T GC C CCT TA TC C CG TAACTATCGT CXTCACTCCA 
GCAAGTCGGG CTGGCGACGC GGAASAGGCC ATTGATAOCA GAACTCACCT 



5151 ACCCGG7AAC ACXCGACTTA TCCCCXCTGG CAGCAGCCAC TGGTAACACO 
5CCGCCATTC TGTGCTGAAT ACCGG7GACC CTCOTCCCTO ACCATTCTCC 



5401 ATTAGCAGAG CGAGCTATGT ACGCGOTOCT ACAGAGTTCT TGAAGTCGTG 
TJLATCCTCTC GCTCCAIACA TCCGCCACCX TGTC7CAAGA ACTTCACCAC 



5451 GCCSAAMAC GGCTACACTA CXAGGACAC? ATTTGCTA5C TGCGCTCTGC 
CGGAWGATC CCCATGTCM? CTTCCTCTCA TAAACCATAG ACCCGAGACG 



5501 TGAACCCAGT TACCTTCCCA AAAAGAGTTG GTACCT C TT G ATCCGGCAXX 
ACTTCGCTCA ATGCAACCg? TTTTCTCAAC CXTCGAGAXC TAGGCCGTTT 



5551 CAJUCCACCG CTGGTACCCG TGCTTTTTTT GTTTGCAAGC AGCAGATTAC 
CTTTGGCGGC CACCATCGCC ACCAAAAAAA CAAACGRCC TCGTCTAATG 



5601 CCGCAGXAAA AXACGATCTC AAGXACATCC TTTCATCTIT TCTACGGGCf 
CGCCTCTTTT TTTCCTACAO TTCTTCTAGG AAACTAGAAX AGATCCCCCA 



5551 CTCaCGCTCA CTGGAACGXA AACTCACGTT AAGGGATT7? CG7CATGAGA 
GAC7GCGAGT CACCTTGCTT TTCAGTGCAA TOCCCTAAAA CCACXACTCT 



C1701 TfATCAAAAA GGATCTTCAC CTAGATCCTT TTAAATTAAA AATGAAGTTT 
AATAGTTTTT CCTAGAAGTC GATCTXCGAA AXTTTAATTT TTAC77CAAA 



?5751 TAAACCAATC TAAACTATAT ATGAG7AAAC TTGGTCTGAC AGTTACCAAT 
AT7TACTTAG ATOTCATATA TACTCA~TTG AACCAGACTG TCAATGGTTA 



N*801 GCWAATCAG TGAGGCACCT ATCTCAGCGA SCTGTCTATT TCCTCCATCC 
Ql CCAATTAGTC ACTCCGTCCA 2AGAGTCGCT AGXCAGATAA AGCAAGTAGG 



^5S51 ATAGTTGCCT CACTCCCGSG GGGCCGGGCG CCGAGGTCTG CCTCCTGAAG 
L TATCAACGCA CTGAGGCCCC CCCCCCCCGC GACTCCAGAC GGAGCACTTC 



filSSOl AAGGTGTTGC MACTCATAC CAGGCCTGAA TCCCCCCATC ATCCAGCCAG 
iLii TICCXCAACG ACTGAGTATG GTCCGGACTT AGCGGCCTAG TACCTCGGTC 



•ESSSI AAAG?GAGGG AGCCXCOOTT GATGAGAGW TTGTTGTXGG TGGACCAGTT 
M 7TICACTCCC TCGGTGCCAA CTACTCTCGA AACAACATCC ACCTGGICAA 

€001 GGTGATTTTC AACTXWGCT ttgccacgga acggtctgcg ctctcgggaa 

CCACTAAAAC TTGAAAACGA AACGGTGCCT TCCCAGACGC AACAGCCCTT 



f 051 GATCCOTGAT CXGATCCMC AACTCAGCAA AAGTTCGATT TATT GAACAA 
C2ACGGACTA GACTAGGAAG TTGAGTCCTT TTCAAGCTAA ATAAGTTGTT 



C101 AGCCGCCGTC CCCTCXAGTC AGCGTAATGC TCTCCCAOTG TTAGAAGCAA 
gCGCCGQCAO CGCAGTTCAG TCGCATSACG ACACGGTCXC AATGTTCGTT 



€151 TTAACCAA7T CTGATTAGAA AAACTCATCG ACCAT CAAAT CAAAC7GCAA 
AASTGG2XAX GACTAWCTT TTXGAGTAGC TCOTAOTTTA CTTTGACGTT 



6201 TTTATTCATA TCAGGATTAT CAATACCATA OTTTTGAAAA AGCCGTTTCT 
AAA7AACTAT ACTCCTAATA OTTATGGTAT AAAAACTTTT TCCGCAAAGA 
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6251 CTAATCAAGG AGAAAACTCA CCOAGGCAGT TCCATAGGAT GGCAAGATCC 
CATTXCTTCC TCTTTTGAC7 GGCTCCGTCA ACCTATCCTA CCGTTC7AGC 



€301 TGGTA7CGG? CTCCOATTCC OACTCGTCCA ACATCAATAC AACCTATTAA 
ACCA7AGCCA GACGCTAAGO CTCACCAGGT TCTAGTTA7G TTGGATAATT 



(351 TTTCCCCTCC TCAAAAATAA GCTTATCAAG TCAGAAATCA CCATGACTGA 
AAAGGGCACC AGTTTTTATT CCAATACTTC ACTCTTTAGT CCTACTCACT 



HiAdlZZ 



€401 CGACKAATC COGTGACJULT CCCXAAAGCT 5ATGCATTTC TTTCCAGACT 
GC7CACTTAC GCCACTCTTA CCOTTTTCGA ATACGTAXAO AAAGGTCTGA 



€451 TGTTCAACAG GCCAGCCATT ACGCTCGTCA TCAAAATCAC TCGCA5CAAC 
ACAACTTTCTC cgstcggsaa TGCGAGCACT AG7TTTAGTG agcctagttc 



Pvui 

€501 CAAACCCCTA TTCATTCGTG AT7GCGCCTG AGCGAGACGA AATACGCGAT 
GTT7CGCAAT AAGTAAGCAC TAACCCflGAC TCCCTCTGCT TTATGCGCTA 



Pvui 

=€551 CCCTGTTAAA AGGACAATTA CAAACAGGAA TCGAATCCAA CCGCCGCAGG 
CCGACAATTT TCCTGTTAAT CTOTGTCCTT AGCTTACGTT CGCCGCGTCC 



^€01 AACACTCCCA GCGCATCAAC AATA 1VX TCA CCTGAATCAG CATATTCTTC 
O TTGTCACGGT CGCGTACTTG TTATAAAAGT GGACTTAGTC CTATAAGAAG 



3*51 TAATACCTOG AATGCTGTTT TCCCCGGGAT CGCAC-TGGTC AGTAACCATG 
J A77ATGGACC TTACGACAAA AGGGCCCCTA GCGTCACWC TCATTGGTAC 



8p701 CATCATCAGG AGTACGGATA AAATCCTTGA TCGTCGGAAG AGGCATAAAT 
GTAG7AGTCC TCATGCCTAT TTTACGAACT ACCAGCCTTC TCCCTATTTA 

r^751 TCCCTCAGCC AGTTTAGTCT GACCATCTCA TCTGTAACAT CATTCGCAAC 
AGGCASTCGG TCAAATCAGA CTGGTACAGT AGACATTGTA CTAACCCKG 



JSSOl CCTACCTTTG CCATG7TTCA GAAACAACTC TOGCGCACCG GGCTTCCCAT 
CGXTGGAAAC GGTACAAAGT CTTTGTTCAG ACCGCGTAGC CCGAAGGCTA 



nbsi ACAATCGATA GATTCTCGCA CCKATTCCC CGACATTATC CCGAGCCCAT 
TGTTACCTAT CTAACAGCCT GGACTAACGG GCTGTAATAG CGCTCCGGTA 



XhoZ 

€901 TTATACCCAT ATAAATCACC ATCCATGTTG CAATTTAATC OCGGCCTCGA 
AATATGGGTA TATTTAGTCG TAGQTACAAC CTTAAATTAG CGCCCGAGCT 



XhoZ 

€951 GCAAGACGTT TCCCGT-TCXX TATGGCTCAT AACACCCCTT CTATTACTOT 
CCTTCTGCAA AGGCCAACTT ATACCGAGTA TTGTGGWXA CA?AATCACA 



7001 TTATGTAAGC AGACAGTW? ATTGTTCATC ATGATASAXT TTTATCOTGT 
AATACATTCG TCTGTCAAAA TAACAACTAC TACTATATAA AAATAGAACA 
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DraXXI 



7051 CCAXTGTAAC ATCAOAGATT T7GAOACACA ACGTOOCTXT CCCCCOCCCC 
CCTXACATTG TAGTCTCTAA AACTCXCTC? TOCACCGAAA GCGGGGGGGG 



7101 CCATTATTOA ACCATTTATC AGGCT7AITG rCTCATGAGC GGATACATAT 
GGCAA2AACT TCCTAAATAC TCCGAAXAAC AfiAGTACTCC CCTATG7ATA 



7151 WGWOTTXT TTAGAAAAAT AAACAAATAG GCGT7CC0CC CACATTTCCC 
AACTTACATA AATCTTTTTA TTTGTTTATC CCCAXCGCGC GTGTAAAGOO 



7301 CCAAAASTGC CACCMACGT CTAAGAAACC ATTATTATCA TCACATTAAC 
GCTT7TCACG CTGOACTGCA CXWCTTTGG TAATAATAlG? ACT6TAAT7C 



7251 CTATAAAAAT AGGCCTATCA CGAGGCCCTT TCCTC 
GAXATTTTTA TCCGCATACT CCTCCGGGAA AGCA0 
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X TCCCCCOTTT CGMGKPGAC GGTGJUUUCC TCTCAGACAT GCAGCTCCCG 
ACCGCGCAAA GOCaCTACTG OCACTTTTGG AOACTCTGTA CCTCOACCCC 



51 GACACGCTCA CA CC TTGTCT QTAAGCGCAT CCCOCOCCA GACAAOCCCG 
CTCTGCCA0T GTCGAACAGA CATTCGCCTA CGGCCCTCGT CTC7TCGGCC 



101 TCAGGGCGCG TCA6CCG0T6 TTGGCGGGM TCGGCCCTGG CTTAACTATG 
AGTCCCCCCC AGTCGCCCAC AACCGCCCAC ACCCCCGACC GAATTGXTAC 



Ndcl 



IS1 CGCCATCACA GCAGATTGTA CTGAGAGTGC ACCXTATGCG GTGTGAAA7A 
CCCGTACTCT CGTCTAACA? GACTCTCACG TGCTATACGC CACACTTTAT 



201 CCGCACAGAT GCCTAAGGAC AAAATACCGC ATCAOATTGG CTAT7GCCCA 
GGCCTGTCTA CGC A TTC C TC TTTTATGGCG TAOTCTAXCC GATAACC66T 



251 TTGCATACGT TCTATCCATA TCATAATATO TXCATOATA T7GGCTCA7Q 
AACGTATGCA ACATAGG7A7 ACTATTATAC ATGTAAATAT AACCGAGTAC 



301 TCCAACATTA CCGCCATQTT GACATTGATT ATTGACTAGT TATTAATAGT 
ACCTTSTAA? GGCCCTACAA CTG7AACTXA TAACTCATCA ATAATTATCA 



r1351 AATCAATTAC CCGGTCATTA GTTCATAGCC CXTATATGGA G7TCCGCC7? 

TTAC7TAATG CCCCACTAAT CAAGTATCGG CTATATACCT CAAGGCGCAA 



y.!<01 ACATAACTTA CGG7AAATGG CCCCCCTGGC TGACCGCCCA ACGACCCCCO 
TGTATTGAA? GCCATTTACC CGGCGGACCC ACTGGCCCGT 7GCTGGGGGC 



CCCA7TGACG TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA 
GCGTAACTCC AGTTATTACT CCATACAAGG GTATCA7TGC CCTTATCCC? 



y $01 CTTTCCATTG ACGTCAATGG CTCGAG7ATT TACGGTAAAC TGCCCACTTC 
CAAAGGTAAC TGCAGTTACC CACCTCATAA ATGCCATTW ACGGGTGAAC 



J 551 GCAGTACATC AAGTCTATCA TATGCCAAGT ACGCCCCCTA TTGACGTCAA 
CGTCATGTAG TTCACATAC? ATACCCTTCA TCCGGGGGAT AACTGCAGTT 



: eOl TGACGG7AAA TGGCCCGCCT GGCATTATGC GCAGTACATC ACCTTATGGG 
ACTGCCATTT ACCGGGCGGA CCCTAATACG GGTCAXGTAC TCGAATACCC 



Wool 

€51 ACTTTCCTAC ttggcactac atctacgtat tagtcatcgc tattaccatg 

TCAAACCAXG AACCGTCATC TAGATCCATA ATCAGTAGCG ATAATGCTAC 



HCOI 

701 07GATCCGGT TTTGGCAGTA CATCAATGGG CCTCGATAGC CGTTTGAC5C 
CACTACGCCA AAACCCTCAT C7AGTTACCC GCACCTATCG CCAAACTGAG 



751 ACGGGGATTT CCAAGTCTOC A CCCCA TTCA CGTCAATCGG AG7TTCTTTT 
TGCCCCTAAA GGTTCAGAGG TGGGGTMCT GCAGTTACCC TCAAACAAAA 



tOl GGCACCAAAA TCAACGGGAC TTTCCAAAAT CTCGTAACAA CTCCGCCCCA 
CCCTGGTTTf AGTTGCCCTG AAAGGTC7TA CAGCATTCTT GAGGCGGGGT 
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ssi Trcx e scAM. tgggccccag ccctgtacgg tgggaggtct A7A7aagcaq 

AAC7GCGT77 ACCCGCCATC CGGACAXCCC ACCC7CCAGA TATA77CCTC 



»01 ACCTCCTTTA G7CAACCGTC AQATCCCCTO GAGACGCCA7 CCACG CIO T T 
7CGAGCAAA7 CACTTGGCAG TCTAGCGGAC C7C7GCGGTA GOTGCCACAA 



sacli 

151 7TGACC7CCA 7AGAAGACAC CGGGACCGAT CCAGCCTCCC CGGCCGGGAA 
AACTGGAGGT A7CTTCTGTG GCCCTGGCTA CG7CGGAGGC GCCC5CCC77 



1001 CGGTGCAMO CAACGCGCAT TCCCCGTGCC AAGAGTGACC 7AAG7ACCGC 
CCCAC6TAAC CTTCC CCC TA AGGGGCACGG TTCTCACTGC A77CAXGGCG 



SphX 

1051 C7A7AGAC7C TA7ACCCACA CCCC7TTGGC 7CT7A7CCA7 GCTATACTCT 
GATATC7CAG A7ATCCOTGT GGGGAAACCG AGAATACGTA CGA7ATGACA 



1101 TXTTCGCTTG GGGCCTATAC ACCCCCGC7T CCTTATGCTA TAGC7GA7GG 
AAAACCGAAC CCCGSATATG TGGGGGCGAA GGAA7ACGAT ATCCACTACC 



1151 TA7AGC7TAC CCTATACGTG 7GCG7TAT7C ACCATTATTG ACCACTCCCC 
A7A7CGAATC GGA7ATCCAC ACCCAA7AAC 7GG7AA7AAC 7GGTGAGGGG 



t rtt01 TATTGG7CAC GATACTTTCC ATTAC7AA7C CA7AACA7GG CTCTTTCCCA 
I;: A7AACCACTG CTATGAAAGG TAATGATTAG C7A77G7ACC GAGAAACGG7 



Gi2Sl CAAC7A7C7C 7A77GGC7A7 A7GCCAA7AC 7C7CTCCTXC AGAGAC7GAC 
GT7GA7AGAG A7AACCGA7A 7ACCG77A7C AGACAGGAAG 7C7C7GAC7G 



a430l ACGGAC7C7G 7A77TTTACA GGA7GCCG7C CCA7T7A77A TTTACAAAT7 
J=_" 7GCC7GAGAC A7AAAAA7G7 CC7ACCCCAG GG7AAA7XA? AAATGTTTAA 



- 1351 CACATATACA ACAACGCCGT CCCCCG7GCC CGCAG77TTT A77AAACATA 
O CTGTATA7GT TGTTGCGGCA CGGGGCXCGG GCGTCAAAAA 7AATT7GTA7 



G liOl GCG7GGGATC TCCACGCGAA 7C7CGGC7AC G7C77CCGGA CATGGSCTCT 
I CGCACCC7AG AGG7GCGCTT AGAGCCCA7G CACAAGGCCT G7ACCCGAGA 



Ol451 7CTCCGG7AG CGGCGGAGCT TCCACATCCG AGCCC7GG7C CCATGCC7CC 
AGAGGCCATC CCCGCCMGA AGCTGTAGGC TCGGGACCAG GGTACGGAGG 



1501 AGCGGC7CA7 GG7CGCTCOG CAGC7CC77G C7CC7AACA0 7GGAGGCCAG 
7CCCCGAG7A CCAGCGAGCC CTCGAGGAAC OAGGA77G7C ACCTCCGGZC 



1551 AC7TAGCCAC AGCACAA7GC CCACCACCAC CaOTOT GC CC CAGAACCCC O 
7GAA7CCG7C TCG7GTTACG GCTGGTGGTC GTGACACGOC G7GTTCCGCC 



1601 7CGCGG7AGG GTATGTG7CT GAAAA7GAGC G7GGAGA7TG GGC7CGCACG 
ACCGCCATCC CATACACACA CTTTTACTCG CACCTMAAC CCGAGCG7GC 



1S51 GC7GACGCAC A7GGAXGXC7 7AAGGCAGCG GCAGAAGAAG ATCCAGGCAG 
CGACTQCGTC TACCTTC7GA ATTCCGTCGC CG7CTTC77C 7ACG7CCGTC 



1701 CTGAG77C7T C7ATTC7GAT AAGAGTCAGA GG7AACTCCC C7TGCGGTCC 
GAC7CAACAA CA7AAGACTA TTCTCAGTCT CCATTGAGGG CAACGCCACG 



-37- 



WO 99/37331 



PCT/US99/01382 



npftX 

1751 TCTTAACCCT ggaccgcag? ctagtctgxg cactactcc? tgctgcccco 

ACAMTXCCCA CCTCCCCTCX CATCAGACTC CTCATGAGCA ACGACCOCGC 



Ncol 

1801 CCCCCC^CCJl GACATAA3CAG CTGACAGAC? AACAGACTG7 TCCTTTCCXT 
CCCCGCTCCT CCC7ATTATC GACTGTCTGA TTCTCTGACA AGGAAAGQTA 



Scol mil EcORVSotI 

1B51 QOCTCTTTXC TGCACTCACC CTCOTCGACA CGTCTGAICA CATATCGCCG 
CCGAGAXAAG ACCTCAG7GC CAGCACCTGT CCACACTAGT CTATAGCGCC 



Uotl Xb*X 

1901 CCCC7CTAGX CCAGGCGCCT GCATCGAXT? GATGAACAT7 AACCCGACAG 

ggcgmatct ggtccccgga ccTAccrrAx ctacttc ta a ttcggctgtc 



1951 TCACCGTAXT CTTCATCTCT CTTAGATTAT TTGTTTTCCA GAGTAGGGGT 
ACTCCCA7TA GAAGXAGAGA GAATCTAAXA AACAAAAGGT CTCA7CCCCA 



2001 CCTCASCTCC TTTCCAATCG TGTAACCAAA ATAAACTCCA CTAGAAGCXT 
r=l CCAGTCCAGG AAAAGTTAGC ACATTGGTTT TATTTGAG3? CA7CTTCCTA 



1$S1 AtTGTGGCGC AACAACACAA TCGGCCWAC AGGAATATTG CAGTTACCTC 
TAACACCCCG TTGTTGTGTT ACCCGCAATG TCCTTATAAC GTCAATGCAG 



1101 CTGATCGATT CAAGAGGACA TCATTCTXTC TTTGGGTAAT TATCCTTTTC 
CACTAGCTAA GTTCTCCTCT AGIAAGAAAG AAACCCATTA ATAGGAAAAG 



MiSl CAAAGAACAT TTTCCATCCC ACTTGGAGTC ATCCACAATA GCACATOACA 
01 GTCTCTTGTA AAAGGTACGG TGAACCCCAG TAGGTCTTAT CGTCTAATGT 



«01 GGTTAGTGA7 CTCGACAAAG TACTTXGTCG TGACAAACTG TCATCCACAA 
r£ CCAATCACTA CAGCTCTTTG ATCAAACAGC AC7GTTTGAC AGTAGCTCTT 



fffSl A7CAATTGAC ATCAGTTGGA CTGAATCCCG AAGGGAATGG AC7GGCAACT 
J5 TAGTTAACTC TACTCAACC? GACTTAGAGC TTCCCTTACC TCACCCTTGA 



2301 CJSjCGTGCCAT CTGCAACTAA AAGATGGGGC TTCAGGTCCG GTGTCCCACC 
r " C7GCACGGTA GACGTTGATT TTCTACCCCG AAGTCCAGGC CACAGGGTGG 



2351 AAAGGTGGTC AATTATGAAG CTGOTGAATO GGCTCAAAAC TCCT ACAAT C 
TTTCCACCAG TTAATACTTC GACCACTTAC CCGACTTTTG ACGA7CTTAG 



2«1 TTCAAATCAA AAAACCTGAC GGGAGTGAGT OTCTACCAQC AGCGCCAGAC 
AACTTXAGTT TTWGGAC7G CCCTCACTCA CAGATCOTCG TCGCGGTCTG 



2451 CCGJC7CGGG GCTTCCCCCC CTGCCCCTAT GTGCACAAAG TATCAGGAXC 
CCWAAGCCC CGAAGGCCGC CACCGCCATA CACGTGTTTC MAGTCCTTG 



J501 CGCACCCTCT GCCGGAGACT TTGCCTTCCA TAAAGAGGGT GCTTTCTTCC 
CCCTG5CACA CCGCCTCTGA AACGGAAGGT ATTtCTCCCA CCAAAGAASG 



2551 TGTA7GATCG ACKGCTTCC ACAGTTATCT ACCCAGGXAC GWTmCGCT 
ACMACTAGC TGAACGAXGG TGTCXATAGA TGGCTCCTCG CTGAAXGCGA 
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2601 CJUGGTCTCG TTGCATTTCT GATACTGCCC CAAGC7AAGA AGGACTTCTT 
CTTCCAC A S C AACG7AAAGA CTATGACGGG GTTCGATTCT TCCTGAAGAA 



2651 CAGCTCACAC CCCTTCAGAG ACCCGGTCAA TGCAACCGAC CACCCCTCTA 
GTCCAGTCTC GGGAACTCTC TCGGCCAflTT ACGTTGCCTC CTGGGCAOAT 



EcoRV 



2701 GTGGCTACTA 7TCTACCACA ATTAGATATC AGGCTACCGG TTTTGGAACC 
CACCGATGAT AAGATGGTGT TAATCTATAG TCCCATGGCC AAAACCTTOO 



2751 AAMACACAG ACTACTTCTT CGAGGTTGAC AATTXCACCT ACCTCCAXCT 
TTACTCTGTC TCATCAACAA CCTCCAACTG 27AAJLC7GGA TGCACCTTGA 



2B01 TGAATCAAGA TTCACACCAC AOTTTC7CCT CCAGCTOAA? CAGACAXTAT 
ACTTA CCT C T AACTGTCCTG TCAAAGACGA CGTCCACTTA CTCTGTTATA 



2851 ATACAAGTGG GAAAAGGACC AA7ACCACGC GXAAACTAAT TTCGAAGGJC 
SATCTTCACC CTTTTCCTCG TTATGGTGCC CTXTTGATTA AACCTTCCAO 



2901 AACCCCCAAA 7TGATACAAC AATCGGG3AG TGGGCCTTCT GGGAAACTAA 
TTGGCGCTTT AACTATCTTG TTAGCCCCTC ACCCGGAAGA CCCT7TGATT 



-fSSl AAAAACCTCA CTAGAAAAAT TCCCAGTC5AA GAGTTCTCTO TCACAGTTCT 

rrrrrcGAGT gatcttttta accgtcactt ctcaacagaa agtgtcaaca 



y|001 ACCAAACGGA GCCAAAAACA TCAGTCCTCA GAGTCCGGCC CGAACT7CTT 
O TAGTTXGCCT CGCTTTTTC? AGTCACCAGT CTCAGGCCGC GCTTGAAGAA 



rS051 CCGACCCACG GACCAACACA ACAACTGAAC ACCACAXAAT CATGCCTTCA 
J! CGCTGGCTCC CTGGTTGTGT TGTTGACTTC CGGTCCTTTA CTACCGAACT 



OIlOl GAAAATXC? CTGCAATGGT TCAAGTGCAC AC5TCAAGGAA GGGAAGCTGC 
CrTTTAAGGA GACGTTACCA ACTTCACGTG TCAGTrCCP? CCCT3CGACG 



ISjlSl AGTGTCCCAT CTAACAACCC TXCCCACAAT CTCCACGAGT CCCCAATCCC 
TCACAGCGTA GATTCTTGGG AACGGTCTTA GACGTGCTCA GGGGTTAGGG 

X32Q1 TCACAACCAA ACCAGGTCCG GACAACAGCA CCCATAATAC ACCCCTGTAT 
X AGTOTTGGTT TGGTCCAGGC CTGTTCTCGT CGGTATTATG TGGCCACATA 



^3251 AAACTGACX TCTCTGAGGC AACTCAAGTT GAACAACATC ACCCCAGAAC 
TTTGAACTGT ACAGACTCCG TTGAGTTCAA CTTGTTGTAG TCCCGTCTTG 



2301 AGACAACCXC ACCACAOCCT CCGACACTCC CTCTGCCXCG ACCCCAGCCC 
TCTCTTCCTG 7CG7CTCGGA GGCTGTGAGG GAGACGGTGC TGGCG7CGGC 



3351 GACCCCCAAA AGCAGAGAAC ACCAACACOA CCAAGAGCAC TCACTTCCTC 
C7GGCGGTTT TCGTCTCTTG TGGTTGTGCT CG7TCTCGTG AC3G&AGGAC 



3401 GACCCCGCCA CCACAACAAG TOCCCAAAAC CACAG C GAGA CCGCTGGCAA 
CTGGGGCGGT CCT G TT G TT C AGGGGTTTTC GTGTCCCTCT CGCQACCGTT 



3451 CAACAXCACT CATCACCAAG ACACCGGACA AGAGASTGCC AGCAGCCGGA 
GTTOTTGTGA CTAGTCOTTC TATGGCCTCT TCTCTCACCC TCGTCCCOCT 
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3501 AGCTACGCTT AATTX^CAAT ACTATTGCTG GAGTCGCAGG ACTCATCACA 

tcsiciccgUA mjlai^tol toatjucgac ctcagcgtcc tcactactct 



3551 CGCCGGNSAA GAACTCGAAG AGAACCAATT CTCAATCCTC AACC CAAAT C 
CCCCCCTCTT CTTGAGCTTC TCTTCGTTAA CAGTTACOAG TTCGGTTTAC 



3601 CAACCCTAAT TTAGATTACT GOACTACTCA CCATGAACGT GCTCCAATCG 
CTPOGQCTTA AATGTAATGA CCTCaTGACT CCTAjOTCCA CCACGTTACC 



3651 GAOTCCCTG CaTACCaTAT TTCGGGCCAG CAGCCGAGGG AATTTACaTA 
CTCACCCGXC CTATCOTATA AAGCCCGGTC MCGGCTCCC T7AAATGTAT 



3701 GAGCGGCTAA TGCACAATCA AGATGOTTTA MC5GTGCCT TGAGACAGCT 
CT CCCC CA T T ACCTGTT A CT TCTACCAAAT TAGACAC CC A ACTCTGTCGA 



3751 GCCCAACGAG accactcaag ctcttcaact cttcctgaga cccacaacw 

CCGGZrXGCTC TCCTGACTTC GaGAAGTTGA CAAGGaCTCT CCGTGTTGAC 



3801 AGCIAGGCAC CTCTTCAATC CTCAaCCGTA aCGCAATTCa tttctkctg 
TCGASGCG7C CAAAACTTAG GAGTTGGCAT TCCCTTAACT AAAGAACCAC 



3851 CAGCCATGGC GCGGCACATG CCaCATTCM GGACCGGACT GCTGTATCGa 
GTCCCIACCC CGCCGTOTAC CG7G7AAGAC CCTCGCCTCA CGACATAGCT 

l 3901 ACCACATGAT TGGACCAXGA ACATAACAGA CAAAATTGAT CAGATTATTC 
S TG 3 TST A CTA AC C WCT C T TC7ATTGTCT GT ' IT TaACTA CTCTAATAAG 



•13951 ATGA3TTTGT TGATAAAACC CTTCCGGACC AGGGGGACAA TGACAAXTGC 
O TACTAAAACA ACTATTTTGG GAAGGCCXCG TCCCCCTGTT ACTGTTAACC 



man tgg&cagga? cgagacaatg gataccggca ggtattggag ttacaggcct 

~1 ACCTGTCCTA CCTCTGCTAC CTATCCCCG7 CCATAACCTC AATC7CCGGA 



55 4051 TATAATTGCA GMaTCGOT TATTCTGTAT ATGCAAATTT GTCTTTTACT 

O ATA3TAACGT CAATAGCGAA ATAAGACATA TACGTTTAAA CAGAAAXTCA 

JfiilOi TTTTCTTCAG ATTGCTTCAT GGAaAAGCTC AGCCTCaAAT CXA7GAAACC 

r ™ AAAAGAAGTC TAACGXAGTA CCTTTTCCXG TCCCAGTTTA GTTACTCTGG 



OtlSl AGGATTTAAT 7ATATGGATT ACTXGAATCT AAGATTACTT GACAAATGAT 
Ui TCCTAAATTA ATATACCTAA TGAACRAGA TTCTAATGAA CTCTTTACTA 



6201 AATATAaTAC AC2GGACC7? TAAACATAGC CAATGTGATT CTAACTCCTT 
TXATATTATG TGACCTCOAA AffTTGTATCG GTTACACTAA GAXTCAGGAA 



4251 TAAACTCACA GTTAATCATA AACAAGGTTT CCAATTCATC TOCTGTGCCT 
AmCASIOT CAATTAGTAT Tf G TT C CAAA CCTtAACTAG ACGACACGOA 



4201 TCTAGTTGCC AGCCATCTGT TCTTTOCCCC TCCCCCOTGC CTTCCTXCaC 
AGAICAACCC TCGGtAGACA ACAAACGGGG AGGCCGCA CC GAAGGAACTG 



4351 CCTGGAAGGT GCCACTCCCA CVUTUUlT g C CTAATAAAAT GAGGAAATTG 
COACCTTCCA CCCTGAGCCT GACAGGAAAG GATTAXTTTA CTCCTTTAAC 



4401 CaTCCCATTG TCTGAGTAGG TGTCATOCTA ctctwccoo TGGCGTGGGG 
GTACCGTAAC AGACTCX7CC ACAGTAXCAT AAGaCCC CC C ACCCCACCCC 



-40- 



WO 99/37331 



PCT/US99/01382 



SphX 

4151 CACCACAGCX AGGGCGAGOA TTGGOAAGAC AATAGGAGOC ATGCTGGGCA 
OTCCtCTCOT TCCCCCTCCT AACCCTTCTG TTATCGTCCfl TACGACCCCT 



Kpnl 

4501 Toccerccac tctatcggta cccacctgct cxagaxttca cccggttcct 

ACGCCACCCG ACATACCCA? GGGTCCACGA CTTCTTXACT GCCCCAAGG A 

4551 CCW3GGCCAG AAAGAAGCAS GCACATCCCC TTCTCTGTGA CACACCCTGT 
CGXCCCGGTC TTTC1TCCTC CGTCTAGGCG AAGAGACACT GTGTCGGACA 

4*01 CC A C Q CCCC T GGTTCTTJLGT TCCXGCCCCA CTCATACGXC ACTCATAQCT 
CCTOCCCCGX CCAAGAATCX AGGTCCCGGT GAGTA2CCTC TGAG7ATCGA 

4651 CAGGACCGCT CCGCCTTCAA TCCCACCCGC TAAAGTACT? OGAGCCOTCT 
GICCTCCCOA GGCGGAAQTT AGGOTOGGCG XTTTCXTGAX CCTCGCCAGA 

4701 CTCCCTCCCT CATCXGCCCA CCAAXCCAAA CCTAGCCTCC AAGAG7GGGA 
GASSCAGGGA GTAGTCGGGT CCTTTGGTTT GGAXCCGAGG TTCTCACCCT 

4751 AGAAXTTAAA CCAAGATAGG CTXWXAGTC CA6AGCCA6A 6AAXXT GCCT 
TCrTTXMTT CGTTCTXTCC CXTAATTCXC GTCTCCCTCT CTTTTACGGA 

'%"Bfii CCXXCATCTC AGGAACTXXT CXGAGAAXTC ATACAATTTC TTCCGMTCC 

w COTCTXCAC TCCTTCAWA ctctctttag tatcttaaxg axgccgaagg 

?iB51 TCCCTCXCTG ACTCGCTGCG C7CGCTCGTT CGGCTGCCGC GXCCGG7XTC 
G XGCGAG7GAC TGAGCGACGC GACCCAGCAA GCCGXCCCCG CTCGCCXTXG 
J?. ..•••.-•*•••••-•--•••-•••••-••••••• 

SO90I AGCTCACTCA AAGGCGGXAA TACGCTTATC CXCXGAATCX GGGGATAACC 
01 TCG&STGXGT TTCCGCCAW XTCCCXXTAC GTGTCMXGT CCCCTAWGC 

^$51 CASGAAAGAA CATGTGUGCA AXXGGCCACC AAAAG CCCAG CAACCGTAAA 
Jlf G7CCTTTCTT GTACACTCG7 TTTCCGGTCG TTTTCCGGTC CTTCCCXTTT 

^^5001 XXGGCCGCG7 TGCTCGCGW TTOCCATAGG CTCCGCCCCC CTGXCGXGCX 
> TTCCCCCGCX XCGXCCCCXX XAAGG7ATCC GAGGCGGGCG GACTGCTCGT 

H5QSI TCACAAAAAT CGXCGCTCXX GTCA6AGCTG ^CJXXKCG XCXGGXCTM 
r* s XGTCTTCT7X GCTCCOXGTT CAGSCTCCAC CCCTttCCGC TCTCCTGXTX 

Slfil XAAGATACCA GGCCTTTCCC CCTGGXXGCT CCCTCGTGCG CTCTCCTCTT 
MTOTAKOT CCCCXXXCGG GGACCTTCCA QQGACCACGC CAOAGCXCAA 

5151 CCCXCCCIGC CGCTTACCGG ATACCTCTCC OCCTTTCTCC CTTCGGGAA& 
COTCCGACO CCGXAWCCC TATGGfcCAGG CGGAAAGAGG GAAGCCCTTC 

5201 CCTGGCGCTT TCTCAATGCT CXCGCTCTXG GTATCTCACT TCGOTCTAGG 
CCACCGCGAA AGAGTTACGA GTGCGACATC CATAGACTCA ACCCACATCC 

5251 * "tcgttcgctc cxagctcggc tctgtocacg aacccoccgt tcaqccccac 
ajxaxbcgag ottcgacccg acacacgtgc tpgcggggca agtccggctg 

5301 ccctgcgcct tatccggtax ctatcotctt cagtccaacc cgctaagaca 
gcgacccgga ataggccatt gataccagaa ctcaggwgg gccattctc? 
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53S1 CCAC TTXTCQ CCACTOGCAfl CA6CCACT06 TAACAQGATT AG CAGAG CCA 
GC7GAATAGC GCTOACCCTC GTCGGTOACC ATOGTCC2AA TCCTCTCGCT 



5401 GGXaXOTAGa CGCTQCTACA GAGTTCTTGA ACTGOTGGCC TAACTACGOC 
CCAtACATCC COCACGATGT CTCAAGAAC? TCACCACCGG ATTGATQCCO 



5451 TACACTAGAA GCACAGTATT TCGTATCTOC CCTCTGCTCA AGCCACTTAC 
ATCXCAICK CCTGTCXTAJL ACCATAGACC CGAOACCACT TCCCTCAATC 



5501 CTTCGGAAAA AGAGTTGGTA CCTCT7CATC CGGCAAACAA ACCACCCCTO 
CAAGCCTTTT TCTCAACCAT CGACAACTAG GCCGTTTCTT TGGTCCCGAC 



5551 CXAGCGCTGG mTmvn' TCCAACCAOC ACAITACCCG CAGAAAAAAA 
CATCCCCACC AAAAAAACAA ACCTTCGTCC TCTAATCCGC GTCTTTTTTT 



5601 GGA7CTCAAG AACATCCTTT GMCTTTTCT ACCGG G T C TG ACGCTCAGTC 
CCTAGAG7CC TTCTAGGAAA C3AGAAAAGA TGCC CC AfiAC TGCGAGTCAC 



5(51 GAACGAAAAC TCACGTTAAG GGATTTTCGT CATGAGATTA TCAAAAAGGA 
C-itWHTlW AOTGCAATTC CCTAAAACCA GTACTCTAAT ACTTTTTCCT 



5701 TCT3CACCTA CATCCCTCTA AATTAAAAAT GAACT7TTAA ATCAAXCTAA 
AGAAGTGGAT CTACGAAAAT CTAATTTTTA CTTCAAAATT TAGTTAGATT 



57S1 AjCSTATATATG agtaaacttg ctctgacagt taccaatgct taatcagtga 

TCATATATAC TCATTTGAAC CAGACTCTCA ATGGTTACGA ATTAGTCAC? 



tfgoi CCCACCTATC TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC 
n CCGJCGATAG AGTCGCTAGA CAGATAAAGC AAGTAGGTAT CAACGGACTG 

?! 851 TCCCSGGGGC GGGCCCGCTC AGGTCTGCCT CCTGAAGAAG GTGTTGCTGA 
AGGCCCCCCC CCCCCGCGAC TCCAGACGGA OCA C TTCTT C CACAACGACT 



05901 C2CASACCAG CCC7GAA7CG CCCCATCATC CACCCAGAAA GTGAGGGAGC 
GAGTATGGTC CGGACTTAGC GGGGTAGTAG GTCGCTCTTT CACTCCCTCG 



4S51 CACCGTTGAT GAGAGCTTTC TTGTAGGTGC ACCAGTTGG7 CA7TTTCAAC 
10 GTCCCAACTA C1CTCGAAAC AACATCCACC TGCTCAACCA CTAAAACTTC 



r$001 T TTT G CTCTG CCACGGAACG GICTGCGTTG TCGGGAAGA? GCGTGATCTG 
ZT AAAACGAAAC GGTGCCTTGC CAGACGCAAC AGCCCTTCTA CCCACXAGAC 



HgOSl ATCCtTCAAC TCAGCAAAAG TTCCATTTAT TCAACAAAGC CGCCCTCCCG 
CASGAAGTTG AOTCGTTTTC AAGCTAAATA AGMGTWCG GCGCCAGGGC 



C101 TCAACTCAGC GTAATCCTC7 CCCAGTCTTA CAACCAATTA ACCAATTCTC 
ACS2CAGTCG CATTACGAGA CGGTCACAAT GTTGCTTAAT TGGTTAAGAC 



(151 ATTACAAAAA CTCATCGAGC ATCAAATGAA ACTGCAATTT ATTCATATCA 
TAATCTTTTT CAGTAGCTCG TAG7T7AC7T TGACCTTAAA TAAGTATAGT 



$201 GCAT7A7CAA TACCATATTT TTGAAAAAGC CGTTTCTGTA AffGAAGCAOA 
CC2AATAGTT ATGGTATAAA AACTZTT9CQ GCAAAGACAT TACTTCCTCT 



«51 AAACTCACCG AGGCAQTTCC ATAGGATGGC AAGATCCTGG TATCGGTCTG 
TTTGACTGGC TCCGTCAAGG TATCCTACCG TTCTAGGACC ATAGCCAOAC 



-42- 



WO 99/37331 



PCT/US99/01382 



6301 CQATT C CCA C TCGTCCAACA 9GAATACAAC CTA7TAATTT CCC C T CG TCA 
GCTAAGGCTG AflC A G GTXC T AOTTATGTTO GATAATTAAA GGCCAGCACT 



€3S1 j^J^AOg y TAT CAAG TCA OAAATCACCA TGAGTCACOA CTCAATCCQG 
TTTTATTCGA ATAGWCACT CTTTAGTCCT ACTCACTGCT GACTTACC CC 



Hi nam 

6401 TGACAATGGC AAAAGCTTAT OCATTXCTTT CCACACTT G T TCAACAGGCC 
ACTCWACCC TTTTCGAATA CGTAAAQAAA GCTCTGAACA AGTTGTCCCG 



€451 AGCCATIACG CTCOTCATCA AAAXCACTCG CATCAACCAA ACCGTTATTC 
TCCCTAATGC GACCACTACT TTTAGTGAGC CTAOTTGCTT TGCCAATAAG 



PVUl 



€501 A77CGTGATT OCGCCTGAGC CACACGAAAT ACGCGATCCC TGTTAAAAGO 
TAAGCACTAA CCCGGACTCG CTCTOCTTTA TOCCCSAGCG ACAATTTTCC 



€551 ACAATTACAA ACACGAATCG AATGCAACCG GCGCACCAAC ACTCCCAGCG 
TG77AAMTT TCTCCTTACC TTACG7TGGC CGCCTCCTTC TGACGGTCCC 



6601 CATCAACAAT ATTTTCACCT CAATCAGCAT ATTCTTCTAA TACCTGGAAT 
GTACrrCTTA 7AAAAGTGGA CTTAGTCCTA TAXGAAGA7T ATGGACCMA 



£ftl GCTGTTTTCC CGGGGATCCC AGTGCTGAGT AACCATGCAT CA7CAGGAQT 
=1:1 CGACAfcAAGC GCCCCTAGCG TCACCACTCA 77GGTACGTA GTAG7CCTCA 



&01 ACGSATAAAA TGC77GATGG TCGGAAGAGG CATAAATTOC GTCAGCCAGT 
5? TCCCTATTTT ACCAACTACC AGCCTTCTCC GTATTTAAGG CAGTCGGTCA 



tMl TTACTCTGAC CATCTCATCT GTAACATCAT TGGCAXCGCT ACCTTTGCCA 
fll AATCAGACTG GTACAGTAGA CATTGTAGTA ACCCTTGC6A TGGAAACGG7 



1*01 TCTTTCAGAA ACAACTCTGG CGCA7CCCGC TTCCCATACA ATCGATAGAT 
AGAAASTCTT TGTTGAGACC GCGTAGCCCG AAGGGTAWT TACCTATCTA 

il51 TCTCSCACCT GATCGCCCGA CATTATCGCG AGCCCATTTA TACCCACXTA 
U. ACAGCGTCCA CTAACGGGCT G7AATAGCGC TCCGGTAAAT ATGGGTATAX 



mOl AATCAGCATC CATGTTGGAA TTTAATCGCG GCCTCGAGCX AGACGTTTCC 
tTAGTCCTAG CTACAACCW AAAMAGCGC CGGAGCTCGT TCTGCAAAGG 



6951 CGTTCAATAT CGCTCATAAC ACCCCTT9TA TTACTGCTTA TGTAAGCAGA 
CCAACTTASA CCCAOTATTC TGGGGAACA3? AATCACAAAT ACATTCGTOT 



7001 CA G TTT TA TT CTTCATGAM ATATATTTTT ATCTTCTOCA ATG7AACATC 
GTCAAAATAA CAAGTACTAC SATAXAAAAA TAOAACACGT TACATTGTAG 



7051 AGACASTTPG AGACACAACG TCGCTTTCCC CCCCCCCCCA 1TATTGAACC 
TCTCIAAAAC TCTCTGTTGC ACCGAAACGG GGGGGGGGGX AATAACTCCG 



7101 ATTTATCAGG GTTATTGTCT CASGAGCGGA TACATATTTG AATQTATTTA 
TAAAXAGTCC CAATAACAGA GTACTCCCCT ATGTAIAAAC TTACATAAAT 
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7151 C*****^AAA CAAATAaOQG T7CCGCOCAC ATTTCCCC6A AAAG7GCCAC 
CWH'IMW OTTTATCCCC kXGGCGCGTC SAAASSGGCT TTTCXCQOTO 

7201 CTOACCTCTA AOAAACCAT? ATTATCAMX CARAACCTA TAAAAATAGO 
«CTCCAfiAT TCTTTCOTAA TAATAOTACT GTAATTCGAT ATTTTTATCC 



1251 CCTATCACGA C fiC C C P ltCU TC 
C3CATAGTCCT CCOGGAAAGC AG 



-44- 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US99/01382 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC(6) :A6IK 48/00; C07H 21/04; C12N 15/63, 15/86, 5/10, 15/40 

US CL :Picase See Extra Sheet 
According to International Patent Classification (IPC) or to both national classifica tion and IPC 
a FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 
U.S. ; 424/450. 93.2. 93.21; 536/23.1, 23.72; 435/5, 6, 455, 457, 458, 320.1 

Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted dunng the international search (name of data base and, where practicable, search terms used) 
APS, Dialog, Biosis, Medline, Biotech 

Search terms: Ebola virus, glycoprotein, transmembrane, targeting 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



US 5,166,320 A (WU et al.) 24 November 1992, columns 9-10 and 
claims 1-18, 

SANCHEZ et al. The virion glycoproteins of Ebola viruses are 
encoded in two reading frames and are expressed through 
transcriptional editing. Proceedings of the National Academy of 
Sciences. April 1996, Vol. 93, pages 3602-3607, especially page 
3604. 

FELGNER et al. Lipofection: A highly efficient, lipid-mediated 
DNA-transfection procedure. Proceedings of the National Academy 
of Sciences. November 1987, Vol. 84, pages 7413-7417, especially 
page 7414. 



1-20 



3, 12 



1-20 



Further documents are listed m the continuation of Box C Q See patent family annex. 



"E* 
•L- 



■o- 

-p. 



Special categories of cited documents 

document defining the generai state of the an which is not considered 
to be of particular relevance 

earlier document published on or after the international filing date 

document which may throw doubts on priority claim (s J or which is 
cited to establish the publication dale of another citation or other 
special reason (as specified) 

document referring to an oral disclosure, use. exhibition or other 
means 

document published prior to the international filing aate but later than 
the priority date claimed 



later document published after the international filing date or priority 
dale and not m conflict with the application but cited to understand 
the principle or theory underlying the invention 

document of particular relevance, the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive ttep 
when the document is taken alone 

document of particular relevance, the claimed invention cannot be 
considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 



document member of the same patent family 



Date of the actual completion of the .-itemational search 



12 APRIL 1999 



Name and mailing address of the ISA/US 
Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C. 20231 
Facsimile No. 



(703) 305-3230 



Form PC171SA/210 (second sheetXJuly 1992)* 



Date of maihng of the international search report 

11 MAY ~~ 



Authorized officer 
0AVID 



Telephone No. (703) 308-0196 



INTERNATIONAL SEARCH REPORT 


International application No. 

PCT/US99/013S2 


C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 


Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


A 


VOLCHKOV et al. GP mRNA of Ebola virus is edited by the 
Ebola virus polymerase and by T7 and vaccinia virus polymerases. 
Virology. 1995, Vol. 214, pages 421-430, especially page 424. 


3, 12 



Form PCT/1SA/210 (continuation of second sheetX-fuly 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US99/01382 



A. CLASSIFICATION OF SUBJECT MATTER: 
US CL : 



424/450, 93.2. 93.21; 536/23.1, 23.72; 435/5, 6, 455, 457, 458, 320.1 



Form PCT/TSA/210 (extra sheetXJuly 1992}* 



Cell Type: 



Xi 

£ 

z 



Figure 1A 



Lymphocyte 



Control 



: / sGP 



1/11 



Figure 1A1 

Monocyte 



Control 



sGP 



iroi / 

X J^1 



Fluorescence Intensity 



09/600766 



Figure 1A2 

Granulocyte 



Control 

\ 



sGP 



lit 



Antibody: 
Ebola Protein: 



Control 



o 




10 10 10* 



CD15 



Figure 1B 



<x-CD15 
sGP 




10 10 in 



CD15 



Figure 1B1 



Supernatant: Control sGP 



w 0> 

3 s 

& o 



0> 



Absorption: 



z S z z S z 



»« 



69 KDa 



— 57 KDa 



1 2 3 4 5 6 



Figure 1C 



2/11 



09/600766 



Cell: HeLa D17 BW5147 




Figure 2A 




Figure 2B 



09/600766 




09/600766 



4/11 



Ab: 



Figure 3A 

Control (Ig) 




Figure 3B 
a-CD16 




Ab: 



£ 
s 

u 



Figure 3C 

Control (Ig) 




Figure 3D 

CC-CD62L 




Stimulation: 



Figure 3E 

None 




Figure 3F 

PMA 




Fluorescence Intensity 



5/11 




Jaqran^ IPO 



09?60e?66"- 

6/11 




1111103 



09/600766 

7/11 



Aw I (6269) 
/find m (6145) N 



Dram(6Z0l) 
Xkol(666S) 
C£zl(6574) 



Ate I (185) 



JVcfe I (571) 
Ncol(697) 

Sac H (992) 
' ,-5*1(1092) 



Xmn I (4570) 




Hga I (1755) 

f^\AtoI(1848) 
\S>C\P.srI(1865) 

\\>XSWI(1875) 
\\\/W 1(1882) 
\\5c/I(1886) 
\£coRV(1894) 
£coRI(1918) 



^7«I(4253) / 
I (4225) 

BarriH. I (4012)// ' 
Nar I (4006)'// 
/3ul(4005) / 
£cc-RI(3992) 

5c/ 1 (3807) 



\ 5amHI(2560) 
A7k> I (2669) 



5c/ 1 (3090) 



£amHI(3421) 



Figure 6 



09/600766 



8/11 



Drum (6871) 
Cla I (6644) 



Pvul(6339) 
/find m (6215) \ 



Mfel(185) 
/ Bal I (248) 
// Nde I (571) 
' NcoI(6?7) 



Sac H (992) 
\$pftl(1092) 



Am/: I (4640) 

Ap«I(4323) 
i5pAI(4295) 
£amH I (4082) 




//co I (1848) 
S^^_5a/I(1875) 
g^N^-J^j/ 1 (1882) 

\\SpH(1921) 
\£coRV(1947) 
Nco I (1977) 

£coRV(2212) 



■ /.• n 

Narl(4076j//// 
Kas I (4075) // 
Xbal(A06G) // 
BamH. I (4056) / 
.BamH I (4044) / / 
&7mHI(3929) / 
-BamH I (3782) 



Afco I (3224) 



-BamH I (3569) 



Figure 7 



09/600766 

9/11 



DraTR (7083) 
Xhol(6941) \ Atfel(185) 
^1(6551) \ \ / Ndel(51V) 

Hind m (6427) L^^^"^"^^^^^^ J*°° 1 ( 69 ?) 



Sac II (992) 
.^1(1092) 



Sph I (4507) 




I (1755) 

A^I (1848) 

|>vPm/I(1882) 
^\£a>RV(1894) 

y^iVb?I(1899) 

\\JK>aI(1906) 
\ -Kay I (19 15) 
Nor I (1916) 



£coRV(2730) 



Figure 8 



09/600766 

10/11 



Dra m (7070) 




Figure 9 




Atty Docket No.: UMV-1474IIS 

DECLARATION, PETITION AND POWER OF ATTORNEY 
FOR PATENT APPLICATION 

(Check one): 

□ Declaration Submitted with Initial Filing 
13 Declaration Submitted after Initial Filing 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name, 

I believe I am the original, first and sole inventor (if only one name is listed below) or an 
original, first and joint inventor (if plural names are listed below) of the subject matter which 
is claimed and for which a patent is sought on the invention entitled: 

TARGETING GENE TRANSFER VECTORS TO CERTAIN CELL 
TYPES BY PSEUDOTYPING WITH VIRAL GLYCOPROTEIN 



the specification of which (check one): 

□ is attached hereto. 

OR 

M was filed on 21 January 1999 as PCT International Application Number 
PCT/US99/01382 

□ and was amended by PCT Article 1 9 Amendment on 

(if applicable), 

□ and was amended by PCT Article 34 Amendment on 

(if applicable). 



I acknowledge the duty to disclose to the Office all information known to me to be material 
to patentability as defined in Title 37, Code of Federal Regulations, §1.56. 

I hereby state that I have reviewed and understood the contents of the above-identified 
specification, including the claims, as amended by any amendment referred to above. 



PRIORITY CLAIM 



(Check one): 

□ no such applications have been filed. 
SI such applications have been filed as follows 

1) FOREIGN PRIORITY CLAIM : I hereby claim foreign priority benefits under Title 35, United 
States Code, § 1 19(a)-(d) or §365(b) of any foreign application(s) for patent or inventor's certificate or 
§3 65 (a) of any PCT international application which designated at least one country other than the 
United States of America, listed below and have also identified below, by checking the box, any foreign 
application for patent or inventor's certificate or any PCT international application having a filing date 
before that of the application on which priority is claimed. 



Prior Foreign 
Application 
Number(s) 


Country 


Foreign Filing 

Date 
(dd/mm/yyyy) 


Priority 
Not Claimed 


Certified Copy 
Attached 
Yes No 








□ 


□ 








□ 


□ 



□ Additional foreign application numbers are listed on a supplemental priority sheet attached hereto. 



2) PROVISIONAL PRIORITY CLAIM : I hereby claim the benefit under Title 35, United States 
Code § 1 19(e) of any United States provisional application(s) listed below. 



Provisional Application Numbers') 


Filing Date (dd/mm/yyyy) 


60/072,033 


21 January 1998 (21.01.98) 







□ Additional provisional application numbers are listed on a supplemental priority sheet attached 
hereto. 



3) U.S./PCT PRIORITY CLAIM : I hereby claim the benefit under Title 35, United States Code, 
§120 of any United States application or §365(c) of any PCT international application designating the 
United States of America, listed below and, insofar as the subject matter of each of the claims of this 
application is not disclosed in the prior United States or PCT international application in the manner 
provided by the first paragraph of Title 35, United States Code, §1 12, 1 acknowledge the duty to 
disclose information which is known to me to be material to patentability as defined in Title 37, Code 
of Federal Regulations, §1 .56 which became available between the filing date of the prior application 
and the national or PCT international filing date of this application. 



U.S. Parent Application 
Number 


PCT Parent Number 


Parent Filing Date 
(dd/mm/yyyy) 


Parent Patent Number 
(if applicable) 




PCT/US99/01382 


21 January 1999 
(21.01.99) 













□ Additional U.S, or PCT international application numbers are listed on a supplemental priority 



Atty Docket No.: UMV-1474US 

DECLARATION, PETITION AND POWER OF ATTORNEY 
FOR PATENT APPLICATION 

(Check one): 

□ Declaration Submitted with Initial Filing 
13 Declaration Submitted after Initial Filing 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name, 

I believe I am the original, first and sole inventor (if only one name is listed below) or an 
original, first and joint inventor (if plural names are listed below) of the subject matter which 
is claimed and for which a patent is sought on the invention entitled: 

TARGETING GENE TRANSFER VECTORS TO CERTAIN CELL 
TYPES BY PSEUDOTYPING WITH VIRAL GLYCOPROTEIN 



the specification of which (check one): 

□ is attached hereto. 
OR 

13 was filed on 21 January 1999 as PCT International Application Number 
PCT/US99/01382 



□ and was amended by PCT Article 1 9 Amendment on 

(if applicable), 

□ and was amended by PCT Article 34 Amendment on 

(if applicable). 

I acknowledge the duty to disclose to the Office all information known to me to be material 
to patentability as defined in Title 37, Code of Federal Regulations, §1.56. 

I hereby state that I have reviewed and understood the contents of the above-identified 
specification, including the claims, as amended by any amendment referred to above. 




-2- 



PRIORITY CLAIM 



(Check one): 

□ no such applications have been filed. 
E such applications have been filed as follows 

!) FOREIGN PRIORITY CLATM; I hereby claim foreign priority benefits under Title 35, United 
States Code, §1 19(a)-(d) or §365(b) of any foreign application(s) for patent or inventor's certificate or 
§3 65(a) of any PCT international application which designated at least one country other than the 
United States of America, listed below and have also identified below, by checking the box, any foreie 
application for patent or inventor's certificate or any PCT international application having a' filing date 
before that of the application on which priority is claimed. 



Prior Foreign 
Application 
Number(s) 


Country 


Foreign Filing 

Date 
(dd/mm/yyyy) 


Priority 
Not Claimed 


Certified Copy 
Attached 
Yes No 








□ 


□ 








□ 


□ b 



Additional foreign application numbers are listed on a supplemental priority sheet attached hereto. 



2 ) PROVISIONAL PRIORITY CLATM: I hereby claim the benefit under Title 35, United States 
Code §1 19(e) of any United States provisional application(s) listed below. 



Provisional Application Number(s) 


Filing Date (dd/mm/yyyy) 


60/072,033 


21 January 1998 (21.01.98) 







□ Additional provisional application numbers are listed on a supplemental priority sheet attached 
hereto. 



3 > U S./PCT PRIORITY CLATM : I hereby claim the benefit under Title 35, United States Code 
§120 of any United States application or §365(c) of any PCT international application designating the 
United States of America, listed below and, insofar as the subject matter of each of the claims of this 
application is not disclosed in the prior United States or PCT international application in the manner 
provided by the first paragraph of Title 35, United States Code, § 1 12, 1 acknowledge the duty to 
disclose information which is known to me to be material to patentability as defined in Title 37 Code 
of Federal Regulations, §1.56 which became available between the filing date of the prior application 
and the national or PCT international filing date of this application. 



U.S. Parent Application 
Number 


PCT Parent Number 


Parent Filing Date 
(dd/mm/yyyy) 


Parent Patent Number 
(if applicable) 




PCT/US99/01382 


21 January 1999 
(21.01.99) 













□ Additional U.S. or PCT international application numbers are listed on a supplemental priority 




POWER OF ATTORNEY: 



As a named inventor, I hereby appoint the following attorneys and/or agents to prosecute this 
application and transact all business in the Patent and Trademark Office connected therewith. 



James E. Cockfield 
Thomas V. Smurzynski 
Ralph A. Loren 
Giulio A. DeConti, Jr. 

nn Lamport Hammitte 
Elizabeth A. Hanley 
Amy E. Mandragouras 
Anthony A. Laurentano 
Jane E. Remillard 
Jeremiah Lynch 
Kevin J. Canning 
David A. Lane } Jr. 
Jeanne M. DiGiorgio 



Reg. No .J9J62 
Reg. Nc x 24.798 
Reg. Nq^2<U25 
Reg. No^2UQ3 
Reg. NO.J4S58 
Reg. No.jX&H 
Reg. N( x_36,207 
Reg. No.i^22a 
Reg. No. 38,872 
Reg. No. VT425 
Reg. No.^35,470 
Reg. No. 39,261 
Reg. No.JJ^WO- 



Megan E. Williams 
Nicholas P. Triano III 
Peter C. Lauro 
DeAnn F. Smith 
William D. DeVaul 
David J. Rikkers 
Chi Suk Kim 
Maria Laccotripe Zacharakis 

Debra J. Milasincic 
David R. Burns 



Reg. No. 43,270 
Reg. No. ,36337 
Reg. No. 32.360 
Reg. No. 36.683 
Reg. No. ^2,433 
Reg. No. 
Reg. No. 42*228 
Limited Recognition 
Under 37 C.F.R. § 10.9(b) 
Reg. No. P46J21L. 
Reg. No. P46,590 



Send Correspondence to: 

DeAnn F 1 Srnit h, Lahive & C ockfi eldjj^ 28State Street. Boston. Massachusetts 02109, 
> United States of America^ ~~~ — 

Direct Telephone Calls to: (name and telephone number) 
DeAnn F. Smith. T617^ 227-7400 

Wherefore I petition that letters patent be granted to me for the invention or discovery described and 
claimed in the attached specification and claims, and hereby subscribe my name to said specification 
and claims and to the foregoing declaration, power of attorney, and this petition. 

I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that 
such willful false statements may jeopardize the validity of the application or any patent issued 
thereon. 



Full name of sole or first inventor 




Gary J. NABEL 




Inventor's signature 


Date 


Residence 




2520 30th Street, N.W., Washington, D.C. 20008 




Citizenship 




United States of America 





POWER OF ATTORNEY: 



. — 



As a named inventor, I hereby appoint the following attorneys and/or agents to prosecute this 
application and transact all business in the Patent and Trademark Office connected therewith. 



James E. Cockfield 
Thomas V. Smurzynski 
Ralph A. Loren 
Giulio A. DeConti, Jr. 
Ann Lamport Hammitte 
Elizabeth A. Hanley 
Amy E. Mandragouras 
Anthony A. Laurentano 
Jane E. Remillard 
Jeremiah Lynch 
Kevin J. Canning 
David A. Lane, Jr. 
Jeanne M. DiGiorgio 



Reg. No. \9A62 
Reg. Nn-?,4,7.fl8 
Reg. No. 29,325 
Reg. No._2i*5Ql 
Reg. No ^34,858 
Reg. NoJ33,505_ 
Reg. No. 36,207 
Reg. No. lggZir 
Reg. No. 38,872^ 
Reg. No J J7 8 425_ 
Reg. No. M^2£L 
Reg. No^24L 
Reg. No. 41,710 



Megan E. Williams 
Nicholas P. Triano III 
Peter C. Lauro 
DeAnn F. Smith 
William D. DeVaul 
David J. Rikkers 
Chi Suk Kim 
Maria Laccotripe Zacharakis 

Debra J, Milasincic 
David R. Burns 



Reg. No. 43,270 
Reg. No. JS6,397 
Reg. No.J2 B lfi0 
Reg. No . 36,683 
Reg. No. 42.483 
Reg. No. 43,88 2 
Reg. No. j42.728_ 
Limited Recognition 
Under 37 C.RR. § 10.9(b) 
Reg. No. P46 jm 
Reg. No. P46,590 



Send Correspondence to: 

DeAnn F, Smith, Lahive^Cockfi eld. LLP. 28 State Street. BostonJVlasj^ m^tts 021 09. 
United States of America 

Direct Telephone Calls to: (name and telephone number) 
DeAnn F. Smith. (617) 227-7400 

Wherefore I petition that letters patent be granted to me for the invention or discovery described and 
claimed in the attached specification and claims, and hereby subscribe my name to said specification 
and claims and to the foregoing declaration, power of attorney, and this petition. 

I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that 
such willful false statements may jeopardize the validity of the application or any patent issued 
thereon. 
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Citizenship 
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Citizenship 
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Zhi-yong YANG 
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Citizenship 

China 
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